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COMPOSITIONS AND METHODS 
RELATING TO LUNG SPECIFIC GENES 

INTRODUCTION 

This application claims the benefit of priority from 
5 U.S. Provisional Application Serial No. 60/228,378, filed 
August 28, 2000, which is herein incorporated in its 
entirety. 

FIELD OF THE INVENTION 

The present invention relates to newly identified 

10 nucleic acids and polypeptides present in normal and 

neoplastic lung cells, including fragments, variants and 
derivatives of the nucleic acids and polypeptides.-' The 
present invention also relates to antibodies to the 
polypeptides of the invention, as well as agonists and 

15 antagonists of the polypeptides of the invention. The 
invention also relates to compositions comprising the 
nucleic acids, polypeptides, antibodies, variants, 
derivatives, agonists and antagonists of the invention and 
methods for the use of these compositions. These uses 

20 include identifying, diagnosing, monitoring, staging, 

imaging and treating lung cancer and non-cancerous disease 
states in lung, identifying lung tissue, monitoring and 
modifying lung embryonic development and differentiation, 
and identifying and/or designing agonists and antagonists 

25 of polypeptides of the invention. The uses also include 
gene therapy, production of transgenic animals and cells, 
and production of engineered lung tissue for treatment and 
research. 
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BACKGROUND OF THE INVENTION 

Throughout the last hundred years, the incidence of 
lung cancer has steadily increased, so much so that now in 
many countries, it is the most common cancers. In fact, 
5 lung cancer is the second most prevalent type of cancer for 
both men and women in the United States and is the most 
common cause of cancer death in both sexes. Lung cancer 
deaths have increased ten- fold in both men and women since 
193 0, primarily due to an increase in cigarette smoking, 

10 but also due to an increased exposure to arsenic, asbestos, 
chromates, chloromethyl ethers, nickel, polycyclic aromatic 
hydrocarbons and other agents. See Scott, Lung Cancer: A 
Guide to Diagnosis and Treatment, Addicus Books (2000) and 
Alberg et al., in Kane et al. (eds.) Biology of Lung 

15 Cancer, pp. 11-52, Marcel Dekker, Inc. (1998) . Lung cancer 
may result from a primary tumor originating in the lung or 
a secondary tumor which has spread from another organ such 
as the bowel or breast. Although there are over a dozen 
types of lung cancer, over 90% fall into two categories: 

20 small cell lung cancer (SCLC) and non-small cell lung 
cancer (NSCLC) . See Scott, supra. About 20-25% of all 
lung cancers are characterized as SCLC, while 70-80% are 
diagnosed as NSCLC. Id. A rare type of lung cancer is 
mesothelioma, which is generally caused by exposure to 

25 asbestos, and which affects the pleura of the lung. Lung 
cancer is usually diagnosed or screened for by chest x-ray, 
CAT scans, PET scans, or by sputum cytology. A diagnosis 
of lung cancer is usually confirmed by biopsy of the 
tissue. Id. 

30 SCLC tumors are highly metastatic and grow 

quickly. By the time a patient has been diagnosed with 
SCLC, the cancer has usually already spread to other parts 
of the body, including lymph nodes, adrenals, liver, bone, 
brain and bone marrow. See Scott, supra; Van Houtte et al. 

35 (eds.), Progress and Perspective in the Treatment of Lung 



WO 02/18576 



PCTAJS01/26684 



- 3 - 

Cancer, Springer -Verlag (1999) . Because the disease has 
usually spread to such an extent that surgery is not an 
option, the current treatment of choice is chemotherapy 
plus chest irradiation. See Van Houtte, supra. The stage 
5 of disease is a principal predictor of long-term survival. 
Less than 5% of patients with extensive disease that has 
spread beyond one lung and surrounding lymph nodes, live 
longer than two years. Id. However, the probability of 
five-year survival is three to four times higher if the 
10 disease is diagnosed and treated when it is still in a 
limited stage, i.e., not having spread beyond one lung. 
Id. 

NSCLC is generally divided into three types: 
squamous cell carcinoma, adenocarcinoma and large cell 

15 carcinoma. Both squamous cell cancer and adenocarcinoma 
develop from the cells that line the airways; however, 
adenocarcinoma develops from the goblet cells that produce 
mucus. Large cell lung cancer has been thus named because 
the cells look large and rounded when viewed 

20 microscopically, and generally are considered relatively 
undifferentiated. See Yesner, Atlas of Lung Cancer, 
Lippincott -Raven (1998) . 

Secondary lung cancer is a cancer initiated elsewhere 
in the body that has spread to the lungs. Cancers that 

25 metastasize to the lung include, but are not limited to, 
breast cancer, melanoma, colon cancer and Hodgkin's 
lymphoma. Treatment for secondary lung cancer may depend 
upon the source of the original cancer. In other words, a 
lung cancer that originated from breast cancer may be more 

30 responsive to breast cancer treatments and a lung cancer 
that originated from the colon cancer may be more 
responsive to colon cancer treatments. 

The stage of a cancer indicates how far it has spread 
and is an important indicator of the prognosis. In 

35 addition, staging is important because treatment is often 
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decided according to the stage of a cancer. SCLC is 
divided into two stages: limited disease, i.e., cancer 
that can only be seen in one lung and in nearby lymph 
nodes; and extensive disease, i.e., cancer that has spread 
5 outside the lung to the chest or to other parts of the 
body. For most patients with SCLC, the disease has already 
progressed to lymph nodes or elsewhere in the body at the 
time of diagnosis. See Scott, supra. Even if spreading is 
not apparent on the scans, it is likely that some cancer 

10 cells may have spread away and traveled through the 

bloodstream or lymph system. In general, chemotherapy with 
or without radiotherapy is often the preferred treatment. 
The initial scans and tests done at first will be used 
later to see how well a patient is responding to treatment. 

15 In contrast, non- small cell cancer may be divided 

into four stages. Stage I is highly localized cancer with 
no cancer in the lymph nodes. Stage II cancer has spread 
to the lymph nodes at the top of the affected lung. Stage 
III cancer has spread near to where the cancer started. 

20 This can be to the chest wall, the covering of the lung 
(pleura) , the middle of the chest (mediastinum) or other 
lymph nodes. Stage IV cancer has spread to another part of 
the body. Stage I- III cancer is usually treated with 
surgery, with or without chemotherapy. Stage IV cancer is 

25 usually treated with chemotherapy and/or palliative care. 

A number of chromosomal and genetic abnormalities 
have been observed in lung cancer. In NSCLC, chromosomal 
aberrations have been described on 3p, 9p, lip, 15p and 
17p, and chromosomal deletions have been seen on 

30 chromosomes 7, 11, 13 and 19. See Skarin (ed.), 

Multimodality Treatment of Lung Cancer, Marcel Dekker, Inc. 
(2000); Gemmill et al., pp. 465-502, in Kane, supra; 
Bailey- Wilson et al., pp. 53-98, in Kane, supra. 
Chromosomal abnormalities have been described on lp, 3p, 

35 5q, 6q, 8q, 13q and 17p in SCLC. Jd. In addition, the 



WO 02/18576 



PCT/US01/26684 



loss of the short arm of chromosome 3p has also been seen 
in greater than 90% of SCLC tumors and approximately 50% of 
NSCLC tumors. Id. 

A number of oncogenes and tumor suppressor genes have 
5 been implicated in lung cancer. See Mabry, pp. 391-412, in 
Kane, supra and Sclafani et al., pp. 295-316, in Kane, 
supra. In both SCLC and NSCLC, the p53 tumor suppressor 
gene is mutated in over 50% of lung cancers. See Yesner, 
supra. Another tumor suppressor gene, FHIT, which is found 

10 on chromosome 3p, is mutated by tobacco smoke. Id.; 

Skarin, supra. In addition, more than 95% of SCLCs and 
approximately 20-60% of NSCLCs have an absent or abnormal 
retinoblastoma (Rb) protein, another tumor suppressor gene. 
The ras oncogene (particularly K-ras) is mutated in 20-30% 

15 of NSCLC specimens and the c-erbB2 oncogene is expressed in 
18% of stage 2 NSCLC and 60% of stage 4 NSCLC specimens. 
See Van Houtte, supra. Other tumor suppressor genes that 
are found in a region of chromosome 9, specifically in the 
region of 9p21, are deleted in many cancer cells, including 

20 p!6 INK4A and pl5 JW3C4B . See Bailey-Wilson, supra; Sclafani et 
al., supra. These tumor suppressor genes may also be 
implicated in lung cancer pathogenesis. 

In addition, many lung cancer cells produce growth 
factors that may act in an autocrine fashion on lung cancer 

25 cells. See Siegfried et al., pp. 317-336, in Kane, supra; 
Moody, pp. 337-370, in Kane, supra and Heasley et al. r 371- 
390, in Kane, supra. In SCLC, many tumor cells produce 
gastrin-releasing peptide (GRP) , which is a proliferative 
growth factor for these cells. See Skarin, supra. Many 

3 0 NSCLC tumors express epidermal growth factor (EGF) 

receptors, allowing NSCLC cells to proliferate in response 
to EGF. Insulin-like growth factor (IGF- I) is elevated in 
greater than 95% of SCLC and greater than 80% of NSCLC 
tumors; it is thought to function as an autocrine growth 
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factor. Id. Finally, stem cell factor (SCF, also known as 
steel factor or kit ligand) and c-Kit {a proto-oncoprotein 
tyrosine kinase receptor for SCF) are both expressed at 
high levels in SCLC, and thus may form an autocrine loop 
5 that increases proliferation. Id. 

Although the majority of lung cancer cases are 
attributable to cigarette smoking, most smokers do not 
develop lung cancer. Epidemiological evidence has 
suggested that susceptibility to lung cancer may be 

10 inherited in a Mendelian fashion, and thus have an 

inherited genetic component. Bailey-Wilson, supra. Thus, 
it is thought that certain allelic variants at some genetic 
loci may affect susceptibility to lung cancer. Id. One way 
to identify which allelic variants are likely to be 

15 involved in lung cancer susceptibility, as well as 

susceptibility to other diseases, is to look at allelic 
variants of genes that are highly expressed in lung. 

The lung is also susceptible to a number of other 
debilitating diseases, including, without limitation, 

20 emphysema, pneumonia, cystic fibrosis and asthma. See 
Stockley (ed.), Molecular Biology of the Lung, Volume I: 
Emphysema and Infection, Birkhauser Verlag (1999) , 
hereafter Stockley I, and Stockley (ed.), Molecular Biology 
of the Lung, Volume II: Asthma and Cancer, Birkhauser 

25 Verlag (1999), hereafter Stockley II. The cause of many 
these disorders is still not well understood and there are 
few, if any, good treatment options for many of these 
noncancerous lung disorders. Thus, there remains a. need to 
understand various noncancerous lung disorders and to 

30 identify treatments for these diseases. 

In yet another aspect, the development and 
differentiation of. the lung tissue is important during 
embryonic development. All of the epithelial cells of the 
respiratory tract, including those of the lung and bronchi, 

35 are derived from the primitive endodermal cells that line 
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the embryonic outpouching. See Yesner, supra. During 
embryonic development, multipotent endodermal stem cells 
differentiate into many different types of specialized 
cells, which include ciliated cells for moving inhaled 
5 particles, goblet cells for producing mucus, Kulchitsky's 
cells for endocrine function, and Clara cells and type II 
pneumocytes for secreting surfactant protein. Id. 
Improper development and differentiation may cause 
respiratory disorders and distress in infants, particularly 

10 in premature infants, whose lungs cannot produce sufficient 
surfactant when they are born. Further, some lung cancer 
cells, particularly small cell carcinomas, appear 
multipotent, and can spontaneously differentiate into a 
number of cell types, including small cell carcinoma, 

15 adenocarcinoma and squamous cell carcinoma. Jd. Thus, a 
better understanding of lung development and 
differentiation may help facilitate understanding of lung 
cancer initiation and progression. 

Accordingly, there is a great need for more sensitive 

20 and accurate methods for predicting whether a person is 
likely to develop lung cancer, for diagnosing lung cancer, 
for monitoring the progression of the disease, for staging 
the lung cancer, for determining whether the lung cancer 
has metastasized and for imaging the lung cancer. There is 

25 also a need for better treatment of lung cancer. Further, 
there is also a great need for diagnosing and treating 
noncancerous lung disorders such as emphysema, pneumonia, 
lung infection, pulmonary fibrosis, cystic fibrosis and 
asthma. There is also a need for compositions and methods 

3 0 of using them that can be used to identify lung tissue for 
forensic purposes and for determining whether a particular 
cell or tissue exhibits lung-specific characteristics. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, 

35 prognosticating, imaging and treating lung cancer via lung 
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specific genes referred to herein as LSGs. For purposes of 
the present invention, LSG refers, among other things, to 
native protein expressed by the gene comprising a 
polynucleotide sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 
5 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74, 
respectively. By U LSG" it is also meant herein 
polynucleotides which, due to degeneracy in genetic coding, 
comprise variations in nucleotide sequence as compared to 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 

10 15, 16, 17, 18, 19, 20, or 74 but which still encode the 
same polypeptide. Exemplary amino acid sequences for LSG 
polypeptides are set forth in SEQ ID NO: 75, 76, 77, 78, 
79, 80, 81, 82, 83 and 84. In the alternative, what is 
meant by LSG as used herein, means the native mRNA encoded 

15 by the gene comprising the polynucleotide sequence of SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20 or 74, levels of the gene comprising the 
polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74 or 

20 levels of a polynucleotide which is capable of hybridizing 
under stringent conditions to the antisense sequence of SEQ 
ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20, or 74. 

Other objects, features, advantages and aspects of 

25 the present invention will become apparent to those of 

skill in the art from the following description. It should 
be understood, however, that the following description and 
the specific examples, while indicating preferred 
embodiments of the invention are given by way of 

30 illustration only. Various changes and modifications 

within the spirit and scope of the disclosed invention will 
become readily apparent to those skilled in the art from 
reading the following description and from reading the 
other parts of the present disclosure. 
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SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
present invention to provide LSGs comprising a 
polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
5 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74, a protein 
expressed by a polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 
6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 
74, or a variant thereof which expresses the protein; or a 
polynucleotide which is capable of hybridizing under 

10 stringent conditions to the antisense sequence of SEQ ID 
NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, or 74. Exemplary LSG polypeptides of the 
present invention are depicted in SEQ ID NO: 75, 76, 77, 
78, 79, 80, 81, 82, 83 or 84. 

15 It is another object of the present invention to 

provide a method for diagnosing the presence of lung cancer 
by analyzing for changes in levels of LSG in cells, tissues 
or bodily fluids compared with levels of LSG in preferably 
the same cells, tissues, or bodily fluid type of a normal 

20 human control, wherein a change in levels of LSG in the 
patient versus the normal human control is associated with 
lung cancer. 

Further provided is a method of diagnosing metastatic 
lung cancer in a patient having lung cancer which is not 

25 known to have metastasized by identifying a human patient 
suspected of having lung cancer that has metastasized; 
analyzing a sample of cells, tissues, or bodily fluid from 
such patient for LSG; comparing the LSG levels in such 
cells, tissues, or bodily fluid with levels of LSG in 

30 preferably the same cells, tissues, or bodily fluid type of 
a normal human control, wherein an increase in LSG levels 
in the patient versus the normal human control is 
associated with lung cancer which has metastasized. 

Also provided by the invention is a method of staging 

35 lung cancer in a human which has such cancer by identifying 
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a human patient having such cancer; analyzing a sample of 
cells, tissues, or bodily fluid from such patient for LSG; 
comparing LSG levels in such cells, tissues, or bodily 
fluid with levels of LSG in preferably the same cells, 
5 tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in LSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
associated with a cancer which is regressing or in 

10 remission. 

Further provided is a method of monitoring lung 
cancer in a human having such cancer for the onset of 
metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 

15 metastasized; periodically analyzing a sample of cells, 
tissues, or bodily fluid from such patient for LSG; 
comparing the LSG levels in such cells, tissue, or bodily 
fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 

20 sample, wherein an increase in LSG levels in the patient 

versus the normal human control is associated with a cancer 
which has metastasized. 

Further provided is a method of monitoring the change 
in stage of lung cancer in a human having such cancer by 

25 looking at levels of LSG in a human having such cancer. 
The method comprises identifying a human patient having 
such cancer; periodically analyzing a sample of cells, 
tissues, or bodily fluid from such patient for LSG; 
comparing the LSG levels in such cells, tissue, or bodily* 

30 fluid with levels of LSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in LSG levels in the patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the levels of LSG is 
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associated with a cancer which is regressing or in 
remission. 

Further provided are methods of designing new 
therapeutic agents targeted to a LSG for use in imaging and 
5 treating lung cancer. For example, in one embodiment, 

therapeutic agents such as antibodies targeted against LSG 
or fragments of such antibodies can be used to treat, 
detect or image localization of LSG in a patient for the 
purpose of detecting or diagnosing a disease or condition. 

10 In this embodiment, an increase in the amount of labeled 
antibody detected as compared to normal tissue would be 
indicative of tumor metastases or growth. Such antibodies 
can be polyclonal, monoclonal, or omniclonal or prepared by 
molecular biology techniques. The term "antibody", as used 

15 herein and throughout the instant specification is also 
meant to include aptamers and single- stranded 
oligonucleotides such as those derived from an in vitro 
evolution protocol referred to as SELEX and well known to 
those skilled in the art. Antibodies can be labeled with a 

20 variety of detectable and therapeutic labels including, but 
not limited to, radioisotopes and paramagnetic metals. 
Therapeutic agents such as small molecules and antibodies 
which decrease the concentration and/or activity of LSG can 
also be used in the treatment, of diseases characterized by 

25 overexpression of LSG. Such agents can be readily 
identified in accordance with teachings herein. 

Other objects, features, advantages and aspects of 
the present invention will become apparent to those of 
skill in the art from the following description. It should 

30 be understood, however, that the following description and 
the specific examples, while indicating preferred 
embodiments of the invention, are given by way of 
illustration only. Various changes and modifications 
within the spirit and scope of the disclosed invention will 

35 become readily apparent to those skilled in the art from 
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reading the following description and from reading the 
other parts of the present disclosure. 



GLOSSARY 

The following illustrative explanations are provided 
5 to facilitate understanding of certain terms used 
frequently herein, particularly in the examples. The 
explanations are provided as a convenience and are not ■ 
limitative of the invention. 

ISOLATED means altered "by the hand of man" from its 
10 natural state;' i.e., that, if it occurs in nature, it has 
been changed or removed from its original environment, or 
both. 

For example, a naturally occurring polynucleotide or 
a polypeptide naturally present in a living animal in its 

15 natural state is not "isolated," but the same 

polynucleotide or polypeptide separated from the coexisting 
materials of its natural state is "isolated" , as the term 
is employed herein. For example, with respect to 
polynucleotides, the term isolated means that it is 

20 separated from the chromosome and cell in which it 
naturally occurs. 

As part of or following isolation, such 
polynucleotides can be joined to other polynucleotides, 
such as DMAs, for mutagenesis, to form fusion proteins, and 

25 for propagation or expression in a host, for instance. The 
isolated polynucleotides, alone or joined to other 
polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. When 
introduced into host cells in culture or in whole 

30 organisms, such DNAs still would be isolated, as the term 
is used herein, because they would not be in their 
naturally occurring form or environment. Similarly, the 
polynucleotides and polypeptides may occur in a 
composition, such as media formulations, solutions for 
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introduction of polynucleotides or polypeptides, for 
example, into cells, compositions or solutions for chemical 
or enzymatic reactions, for instance, which are not 
naturally occurring compositions, and, therein remain 
5 isolated polynucleotides or polypeptides within the meaning 
of that term as it is employed herein. 

OLIGONUCLEOTIDE (S) refers to relatively short 
polynucleotides. Often the term refers to single -stranded 
deoxyribonucleo tides, but it can refer as well to single-or 

10 double- stranded ribonucleotides, RNA : DNA hybrids and 
double- stranded DNAs, among others. 

Oligonucleotides, such as single-stranded DNA probe 
oligonucleotides, often are synthesized by chemical 
methods, such as those implemented on automated 

15 oligonucleotide synthesizers. However, oligonucleotides 
can be made by a variety of other methods, including in 
vitro recombinant DNA-mediated techniques and by expression 
of DNAs in cells and organisms. 

Initially, chemically synthesized DNAs typically are 

20 obtained without a 5 1 phosphate. The 5' ends of such 

oligonucleotides are not substrates for phosphodiester bond 
formation by ligation reactions that employ DNA ligases 
typically used to form recombinant DNA molecules. Where 
ligation of such oligonucleotides is desired, a phosphate 

25 can be added by standard techniques, such as those that 
employ a kinase and ATP. 

The 3 ' end of a chemically synthesized 
oligonucleotide generally has a free hydroxyl group and, in 
the presence of a ligase such as T4 DNA ligase, readily 

30 will form a phosphodiester bond with a 5 ! phosphate of 
another polynucleotide, such as another oligonucleotide. 
As is well known, this reaction can be prevented 
selectively, where desired, by removing the 5 ' phosphates 
of the other polynucleotide (s) prior to ligation. 
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POLYNUCLEOTIDE (S) generally refers to any 
polyribonucleotide or polydeoxribonucleotide and is 
inclusive of unmodified RNA or DNA as well as modified RNA 
or DNA. Thus, for instance, polynucleotides as used herein 
5 refers to, among other things, single- and double- stranded 
DNA, DNA that is a mixture of single- and double -stranded 
regions, single- and double- stranded RNA, and RNA that is 
mixture of single- and double-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single- 

10 stranded or, more typically, double- stranded or a mixture 
of single- and double- stranded regions. In addition, 
polynucleotide, as used herein, refers to triple -stranded 
regions comprising RNA or DNA or both RNA and DNA. The 
strands in such regions may be from the same molecule or 

15 from different molecules. The regions may include all of 
one or more of the molecules, but more typically involve 
only a region of some of the molecules . One of the 
molecules of a triple-helical region often is an 
oligonucleotide . 

20 As used herein, the term polynucleotide is also 

inclusive of DNAs or RNAs as described above that contain 
one or more modified bases. Thus, DNAs or RNAs with 
backbones modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. 

25 Moreover, DNAs or RNAs comprising unusual bases, such as 
inosine, or modified bases, such as tritylated bases, to 
name just two examples, are polynucleotides as the term is 
used herein. 

It will be appreciated that a great variety of 

30 modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skill in the art. The 
term polynucleotide as it is employed herein embraces such 
chemically, enzymatically or metabolically modified forms 
of polynucleotides, as well as chemical forms of DNA and 
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RNA characteristic of viruses and cells, including simple 
and complex cells, inter alia. 

POLYPEPTIDES , as used herein, includes all 
polypeptides as described below. The basic structure of 
5 polypeptides is well known and has been described in 
innumerable textbooks and other publications in the art. 
In this context, the term is used herein to refer to any 
peptide or protein comprising two or more amino acids 
joined to each other in a linear chain by peptide bonds. 

10 As used herein, the term refers to both short chains, which 
also commonly are referred to in the art as peptides, 
oligopeptides and oligomers, for example, and to longer 
chains, which generally are referred to in the art as 
proteins, of which there are many types. It will be 

15 appreciated that polypeptides often contain amino acids 
other than the 20 amino acids commonly referred to as the 
20 naturally occurring amino acids, and that many amino 
acids, including the terminal amino acids, may be modified 
in a given polypeptide, either by natural processes such as 

20 processing and other post-translational modifications, or 
by chemical modification techniques which are well known to 
the art. Even the common modifications that occur 
naturally in polypeptides are too numerous to list 
exhaustively here, but they are well described in basic 

25 texts and in more detailed monographs, as well as in a 

voluminous research literature, and they are well known to 
those of skill in the art. 

Modifications which may be present in polypeptides of 
the present invention include, to name an illustrative few, 

30 acetylation, acylation, ADP-ribosylation, amidation, 

covalent attachment of flavin, covalent attachment of a 
heme moiety, covalent attachment of a nucleotide or 
nucleotide derivative, covalent attachment of a lipid or 
lipid derivative, covalent attachment of 

35 phosphotidylinositol, cross-linking, cyclization, disulfide 
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bond formation, demethylation, formation of covalent cross- 
links, formation of cystine, formation of pyroglutamate, 
formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, 
5 myristoylation, oxidation, proteolytic processing, 

phosphory 1 a t i on , prenyl ation, racemization, sel enoy 1 a t i on , 
sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill 

10 and have been described in great detail in the scientific 
literature. Several particularly common modifications 
including, but not limited to, glycosylation, lipid 
attachment, sulfation, gamma-carboxylation of glutamic acid 
residues, hydroxylation and ADP-ribosylation are described 

15 in most basic texts, such as, for instance PROTEINS 
STRUCTURE AMD MOLECULAR PROPERTIES, 2nd Ed., T. E. 
Cr eight on, W. H. Freeman and Company, New York (1993) . Many 
detailed reviews are available on this subject, such as, 
for example, those provided by Wold, F., Posttranslational 

20 Protein Modifications: Perspectives and Prospects, pgs. 1- 
12 in POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, 
B. C. Johnson, Ed., Academic Press, New York (1983); 
Seifter et al., Analysis for protein modifications and 
nonprotein cof actors, Meth. Enzymol. 182: 626-646 (1990) 

25 and Rattan et al., Protein Synthesis: Posttranslational 
Modifications and Aging, Ann, N.Y. Acad. Sci. 663: 48-62 
(1992) . 

It will be appreciated that the polypeptides of the 
present invention are not always entirely linear. Instead, 

30 polypeptides may be branched as a result of ubiquitination, 
and they may be circular, with or without branching, 
generally as a result of posttranslation events including 
natural processing event and events brought about by human 
manipulation which do not occur naturally. Circular, 

35 branched and branched circular polypeptides may be 
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synthesized by non- translation natural processes and by 
entirely synthetic methods, as well. 

Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains 
5 and the amino or carboxyl termini. In fact, blockage of 
the amino and/or carboxyl group in a polypeptide by a 
covalent modification is common in naturally occurring and 
synthetic polypeptides and such modifications may be 
present in polypeptides of the present invention, as well. 

10 For instance, the amino terminal residue of polypeptides 
made in E. coli, prior to proteolytic processing, almost 
invariably will be N-f ormylmethionine . 

The modifications that occur in a polypeptide often 
will be a function of how it is made. For polypeptides 

15 made by expressing a cloned gene in a host, for instance, 
the nature and extent of the modifications, in large part, 
will be determined by the host cell posttranslational 
modification capacity and the modification signals present 
in the polypeptide amino acid sequence. For instance, as 

20 is well known, glycosylation often does not occur in 
bacterial hosts such as E. coli. Accordingly, when 
glycosylation is desired, a polypeptide can be expressed in 
a glycosylating host, generally a eukaryotic cell. Insect 
cells often carry out the same posttranslational 

25 glycosylations as mammalian cells. Thus, insect cell 
expression systems have been developed to express 
efficiently mammalian proteins having native patterns of 
glycosylation, inter alia. Similar considerations apply to 
other modifications. 

30 It will be appreciated that the same type of 

modification may be present in the same or varying degrees 
at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. 

In general, as used herein, the term polypeptide 

35 encompasses all such modifications, particularly those that 
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are present in polypeptides synthesized by expressing a 
polynucleotide in a host cell. 

VARIANT (S) of polynucleotides or polypeptides, as the 
term is used herein, are polynucleotides or polypeptides 
5 that differ from a reference polynucleotide or polypeptide, 
respectively. 

With respect to variant polynucleotides, differences 
are generally limited so that the nucleotide sequences of 
the reference and the variant are closely similar overall 
and, in many regions, identical. Thus, changes in the 
nucleotide sequence of the variant may be silent. That is, 
they may not alter the amino acids encoded by the 
polynucleotide. Where alterations are limited to silent 
changes of this type a variant will encode a polypeptide 
with the same amino acid sequence as the reference. 
Alternatively, changes in the nucleotide sequence of the 
variant may alter the amino acid sequence of a polypeptide 
encoded by the reference polynucleotide. Such nucleotide 
changes may result in amino acid substitutions, additions, 
deletions, fusions and truncations in the polypeptide 
encoded by the reference sequence. 

With respect to variant polypeptides, differences are 
generally limited so that the sequences of the reference 
and the variant are closely similar overall and, in many 
region, identical. For example, a variant and reference 
polypeptide may differ in amino acid sequence by one or 
more substitutions, additions, deletions, fusions and 
truncations, which may be present in any combination. 

RECEPTOR MOLECULE, as used herein, refers to 
molecules which bind or interact specifically with LSG 
polypeptides of the present invention and is inclusive not 
only of classic receptors, which are preferred, but also 
other molecules that specifically bind to or interact with 
polypeptides of the invention (which also may be referred 
to as "binding molecules" and "interaction molecules," 
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respectively and as W LSG binding or interaction molecules" . 
Binding between polypeptides of the invention and such 
molecules, including receptor or binding or interaction 
molecules may be exclusive to polypeptides of the 
5 invention, which is very highly preferred, or it may be 
highly specific for polypeptides of the invention, which is 
highly preferred, or it may be highly specific to a group 
of proteins that includes polypeptides of the invention, 
which is preferred, or it may be specific to several groups 
10 of proteins at least one of which includes polypeptides of 
the invent ion . 

Receptors also may be non-naturally occurring, such 
as antibodies and antibody- derived reagents that bind to 
polypeptides of the invention. 

15 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to novel lung specific 
polypeptides and polynucleotides, referred to herein as 
LSGs, among other things, as described in greater detail 
below. 
20 Polynucleotides 

In accordance with one aspect of the present 
invention, there are provided isolated LSG polynucleotides 
which encode LSG polypeptides. 

Using the information provided herein, such as the 
25 polynucleotide sequences set out in SEQ ID N0:1, 2, 3, 4, 
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 
or 74 a polynucleotide of the present invention encoding a 
LSG may be obtained using standard cloning and screening 
procedures, such as those for cloning cDNAs using mRNA from 
30 cells of a human tumor as starting material. 

Polynucleotides of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, 
including, for instance, cDNA and genomic DNA obtained by 
cloning or produced by chemical synthetic techniques or by 
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a combination thereof. The DNA may be double- stranded or 
single -stranded. Single -stranded DNA may be the coding 
strand, also known as the sense strand, or it may be the 
non-coding strand, also referred to as the anti-sense 
5 strand. 

The coding sequence which encodes the polypeptides 
may be identical to the coding sequence of the 
polynucleotides of SEQ ID NO:l, 2, 3, 4, 5, 6, 7 , 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. It also may 
10 be a polynucleotide with a different sequence, which, as a 
result of the redundancy (degeneracy) of the genetic code, 
encodes the same polypeptides as encoded by SEQ ID NO : 1 , 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20 or 74. 

15 Polynucleotides of the present invention, such as SEQ 

ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20 or 74 which encode these polypeptides 
may comprise the coding sequence for the mature polypeptide 
by itself. Polynucleotides of the present invention may 

20 also comprise the coding sequence for the mature 

polypeptide and additional coding sequences such as those 
encoding a leader or secretory sequence such as a pre-, or 
pro- or prepro-protein sequence. Polynucleotides of the 
present invention may also comprise the coding sequence of 

25 the mature polypeptide, with or without the aforementioned 
additional coding sequences, together with additional, non- 
coding sequences. Examples of additional non- coding 
sequences which may be incorporated into the polynucleotide 
of the present invention include, but are not limited to, 

3 0 introns and non-coding 5' and 3 r sequences such as 

transcribed, non-translated sequences that play a role in 
transcription, mRNA processing including, for example, 
splicing and polyadenylation signals, ribosome binding and 
stability of mRNA, and additional coding sequence which 

35 codes for amino acids such as those which provide 
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additional functionalities. Thus, for instance, the 
polypeptide may be fused to a marker sequence such as a 
peptide which facilitates purification of the fused 
polypeptide. In certain preferred embodiments of this 
5 aspect of the invention, the marker sequence is a hexa- 
histidine peptide, such as the tag provided in the pQE 
vector (Qiagen, Inc.), among others, many of which are 
commercially available. As described in Gentz et al. (Proc. 
Natl. Acad. Sci., USA 86: 821-824 (1989)), for instance, 

10 hexa-histidine provides for convenient purification of the 
fusion protein. The HA tag corresponds to an epitope 
derived of influenza hemagglutinin protein (Wilson et al., 
Cell 37: 767 (1984) ) . 

In accordance with the foregoing, the term 

15 "polynucleotide encoding a polypeptide" as used herein 
encompasses polynucleotides which include a sequence 
encoding a polypeptide of the present invention, 
particularly SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. Exemplary 

20 polypeptides encoded by the polynucleotides are depicted in 
SEQ ID NO: 75, 76, 77, 78, 79, 80, 81, 82, 83 and 84. The 
term encompasses polynucleotides that include a single 
continuous region or discontinuous regions encoding the 
polypeptide (for example, interrupted by introns) together 

25 with additional regions, that also may contain coding 
and/or non- coding sequences. 

The present invention further relates to variants of 
the herein above described polynucleotides which encode for 
fragments, analogs and derivatives of the LSG polypeptides. 

30 A variant of the polynucleotide may be a naturally 

occurring variant such as a naturally occurring allelic 
variant, or it may be a variant that is not known to occur 
naturally. Such non-naturally occurring variants of the 
polynucleotide may be made by mutagenesis techniques, 
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including those applied to polynucleotides, cells or 
organisms. 

Among variants in this regard are variants that 
differ from the aforementioned polynucleotides by 
5 nucleotide substitutions, deletions or additions. The 
substitutions, deletions or additions may involve one or 
more nucleotides. The variants may be altered in coding or 
non- coding regions or both. Alterations in the coding 
regions may produce conservative or non- conservative amino 

10 acid substitutions, deletions or additions. 

Among the particularly preferred embodiments of the 
invention in this regard are polynucleotides encoding 
polypeptides having the same amino acid sequence encoded by 
a LSG polynucleotide comprising SEQ ID NO: 1, 2, 3, 4, 5, 

15 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 
74; variants, analogs, derivatives and fragments thereof, 
and fragments of the variants, analogs and derivatives. 
Exemplary polypeptides encoded by these polynucleotides are 
depicted in SEQ ID NO:75, 76, 77, 78, 79, 80, 81, 82, 83 

20 and 84. Further particularly preferred in this regard are 
LSG polynucleotides encoding polypeptide variants, analogs, 
derivatives and fragments, and variants, analogs and 
derivatives of the fragments, in which several, a few, 5 to 
10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are 

25 substituted, deleted or added, in any combination. 

Especially preferred among these are silent substitutions, 
additions and deletions, which do not alter the properties 
and activities of the LSG. Also especially preferred in 
this regard are conservative substitutions. Most highly 

30 preferred are polynucleotides encoding polypeptides having 
the amino acid sequences as polypeptides encoded by SEQ ID 
NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20 or 74, without substitutions. 

Further preferred embodiments of the invention are 

35 LSG polynucleotides that are at least 70% identical to a 
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polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74 and 
polynucleotides which are complementary to such 
polynucleotides. More preferred are LSG polynucleotides 
5 that comprise a region that is at least 80% identical to a 
polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. In this 
regard, LSG polynucleotides at least 90% identical to the 
same are particularly preferred, and among these 

10 particularly preferred LSG polynucleotides, those with at 
least 95% are especially preferred. Furthermore, those 
with at least 97% are highly preferred among those with at 
least 95%, and among these those with at least 98% and at 
least 99% are particularly highly preferred, with at least 

15 99% being the most preferred. 

Particularly preferred embodiments in this respect, 
moreover, are polynucleotides which encode polypeptides 
which retain substantially the same biological function or 
activity as the mature polypeptides encoded by a 

20 polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. 

The present invention further relates to 
polynucleotides that hybridize to the herein above- 
described LSG sequences. In this regard, the present 

25 invention especially relates to polynucleotides which 

hybridize under stringent conditions to the herein above- 
described polynucleotides. As herein used, the term 
"stringent conditions" means hybridization will occur only 
if there is at least 95% and preferably at least 97% 

30 identity between the sequences. 

As discussed additionally herein regarding 
polynucleotide assays of the invention, for instance, 
polynucleotides of the invention as described herein, may 
be used as a hybridization probe for cDNA and genomic DNA 

35 to isolate full-length cDNAs and genomic clones encoding 
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LSGs and to isolate cDNA and genomic clones of other genes 
that have a high sequence similarity to these LSGs. Such 
probes generally will comprise at least 15 bases. 
Preferably, such probes will have at least 30 bases and may 
5 have at least 50 bases. 

For example, the coding region of LSG of the present 
invention may be isolated by screening using an 
oligonucleotide probe synthesized from the known DNA 
sequence. A labeled oligonucleotide having a sequence 

10 complementary to that of a gene of the present invention is 
used to screen a library of human cDNA, genomic DNA or mRNA 
to determine which members of the library the probe 
hybridizes with. 

The polynucleotides and polypeptides of the present 

15 invention may be employed as research reagents and 

materials for discovery of treatments and diagnostics to 
human disease, as further discussed herein relating to 
polynucleotide assays, inter alia. 

The polynucleotides may encode a polypeptide which is 

20 the mature protein plus additional amino or carboxyl- 

terminal amino acids, or amino acids interior to the mature 
polypeptide (when the mature form has more than one 
polypeptide chain, for instance) . Such sequences may play a 
role in processing of a .protein from precursor to a mature 

25 form, may facilitate/protein trafficking, may prolong or 
shorten protein half-life or may facilitate manipulation of 
a protein for assay or production, among other things. As 
generally is the case in situ, the additional amino acids 
may be processed away from the mature protein by cellular 

3 0 enzymes. 

A precursor protein having the mature form of the 
polypeptide fused to one or more prosequences may be an 
inactive form of the polypeptide. When prosequences are 
removed, such inactive precursors generally are activated. 
35 Some or all of the prosequences may be removed before 
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activation. Generally, such precursors are called 
proproteins . 

In sum, a polynucleotide of the present invention may 
encode a mature protein, a mature protein plus a leader 
5 sequence (which may be referred to as a preprotein) , a 
precursor of a mature protein having one or more 
prosequences which are not the leader sequences of a 
preprotein, or a preproprotein, which is a precursor to a 
proprotein, having a leader sequence and one or more 

10 prosequences, which generally are removed during processing 
steps that produce active and mature forms of the 
polypeptide. 
Polypeptides 

The present invention further relates to LSG 

15 polypeptides, preferably polypeptides encoded by a 

polynucleotide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. Exemplary 
polypeptides are depicted in SEQ ID NO: 75, 76, 77, 78, 79, 
80, 81, 82, 83 or 84. The invention also relates to 

20 fragments, analogs and derivatives of these polypeptides. 
The terms "fragment," "derivative" and "analog" when 
referring to the polypeptides of the present invention 
means a polypeptide which retains essentially the same 
biological function or activity as such polypeptides. 

25 Thus, an analog includes a proprotein which can be 

activated by cleavage of the proprotein portion to produce 
an active mature polypeptide. 

The polypeptide of the present invention may be a 
recombinant polypeptide, a natural polypeptide or a 

30 synthetic polypeptide. In certain preferred embodiments it 
is a recombinant polypeptide. 

The fragment, derivative or analog of a polypeptide 
of or the present invention may be (I) one in which one or 
more of the amino acid residues are substituted with a 

35 conserved or non-conserved amino acid residue (preferably a 
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acid residue may or may not be one encoded by the genetic 
code; (ii) one in which one or more of the amino acid 
residues includes a substituent group; (iii) one in which 
5 the mature polypeptide is fused with another compound, such 
as a compound to increase the half-life of the polypeptide 
(for example, polyethylene glycol) ; or (iv) one in which 
the additional amino acids are fused to the mature 
polypeptide, such as a leader or secretory sequence or a 

10 sequence which is employed for purification of the mature 
polypeptide or a proprotein sequence. Such fragments, 
derivatives and analogs are deemed to be within the scope 
of those skilled in the art from the teachings herein. 

Among preferred variants are those that vary from a 

15 reference by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid 
in a polypeptide by another amino acid of like 
characteristics. Typically seen as conservative 
substitutions are the replacements, one for another, among 

20 the aliphatic amino acids Ala, Val, Leu and lie; 

interchange of the hydroxyl residues Ser and Thr, exchange 
of the acidic residues Asp and Glu, substitution between 
the amide residues Asn and Gin, exchange of the basic 
residues Lys and Arg and replacements among the aromatic 

25 residues Phe, Tyr. 

The polypeptides and polynucleotides of the present 
invention are preferably provided in an isolated form, and 
preferably are purified to homogeneity. 

The polypeptides of the present invention include the 

30 polypeptides encoded by the polynucleotide of SEQ ID NO: 1, 
2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, 20 or 74 (in particular the mature polypeptide) as well 
as polypeptides which have at least 75% similarity 
(preferably at least 75% identity) , more preferably at 

35 least 90% similarity (more preferably at least 90% 
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identity) , still more preferably at least 95% similarity 
{still more preferably at least 95% identity) , to a 
polypeptide encoded by SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74. Also 
5 included are portions of such polypeptides generally 

containing at least 30 amino acids and more preferably at 
least 50 amino acids. Exemplary polypeptides are depicted 
in SEQ ID NO:75, 76, 77, 78, 79, 80, 81, 82, 83 or 84. 
As known in the art 11 similarity" between two 

10 polypeptides is determined by comparing the amino acid 
sequence and its conserved amino acid substitutes of one 
polypeptide sequence with that of a second polypeptide. 

Fragments or portions of the polypeptides of the 
present invention may be employed for producing the 

15 corresponding full-length polypeptide by peptide synthesis; 
therefore, the fragments may be employed as intermediates 
for producing the full-length polypeptides. Fragments or 
portions of the polynucleotides of the present invention 
may be used to synthesize full-length polynucleotides of 

20 the present invention. 
Fragments 

Also among preferred embodiments of this aspect of 
the present invention are polypeptides comprising fragments 
of a polypeptide encoded by a polynucleotide of SEQ ID NO: 

25 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20 or 74. In this regard a fragment is a 
polypeptide having an amino acid sequence that entirely is 
the same as part but not all of the amino acid sequence of 
the aforementioned LSG polypeptides and variants or 

30 derivatives thereof. 

Such fragments may be "free-standing," i.e., not part 
of or fused to other amino acids or polypeptides, or they 
may be contained within a larger polypeptide of which they 
form a part or region. When contained within a larger 

35 polypeptide, the presently discussed fragments most 
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preferably form a single continuous region. However, 
several fragments may be comprised within a single larger 
polypeptide. For instance, certain preferred embodiments 
relate to a fragment of a LSG polypeptide of the present 
5 comprised within a precursor polypeptide designed for 

expression in a host and having heterologous pre- and pro- 
polypeptide regions fused to the amino terminus of the LSG 
fragment and an additional region fused to the carboxyl 
terminus of the fragment. Therefore, fragments in one 

10 aspect of the meaning intended herein, refers to the 
portion or portions of a fusion polypeptide or fusion 
protein derived from a LSG polypeptide. 

As representative examples of polypeptide fragments 
of the invention, there may be mentioned those which have 

15 from about 15 to about 139 amino acids. In this context 
*about" includes the particularly recited range and ranges 
larger or smaller by several, a few, 5, 4, 3, 2 or 1 amino 
acid at either extreme or at both extremes. Highly 
. preferred in this regard are the recited ranges plus or 

20 minus as many as 5 amino acids at either or at both 

extremes. Particularly highly preferred are the recited 
ranges plus or minus as many as 3 amino acids at either or 
at both the recited extremes. Especially preferred are 
ranges plus or minus 1 amino acid at either or at both 

25 extremes or the recited ranges with no additions or 

deletions. Most highly preferred of all in this regard are 
fragments from about 15 to about 45 amino acids. 

Among especially preferred fragments of the invention 
are truncation mutants of the LSG polypeptides. Truncation 

30 mutants include LSG polypeptides having an amino acid 

sequence encoded by a polynucleotide of SEQ ID NO: 1, 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 
20 or 74 or variants or derivatives thereof, except for 
deletion of a continuous series of residues (that is, a 

35 continuous region, part or portion) that includes the amino 
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terminus, or a continuous series of residues that includes 
the carboxyl terminus or, as in double truncation mutants, 
deletion of two continuous series of residues, one 
including the amino terminus and one including the carboxyl 
5 terminus. Fragments having the si2e ranges set out herein 
also are preferred embodiments of truncation fragments, 
which are especially preferred among fragments generally. 

Also preferred in this aspect of the invention are 
fragments characterized by structural or functional 

10 attributes of the LSG polypeptides of the present 

invention. Preferred embodiments of the invention in this 
regard include fragments that comprise alpha-helix and 
alpha-helix forming regions ( "alpha -regions " ) , beta-sheet 
and beta- sheet -forming regions ( "beta -regions ") , turn and 

15 turn-forming regions ( 11 turn- regions 11 ) , coil and coil- 
forming regions ("coil -regions" ) , hydrophilic regions, 
hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surf ace -forming 
regions and high antigenic index regions of the LSG 

20 polypeptides of the present invention. Regions of the 

aforementioned types are identified routinely by analysis 
of the amino acid sequences encoded by the polynucleotides 
Of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20 or 74. Preferred regions 

25 include Garnier-Robson alpha - regions , beta-regions, turn- 
regions and coil-regions, Chou-Fasman alpha -regions, beta- 
regions and turn-regions, Kyte-Doolittle hydrophilic 
regions and hydrophilic regions, Eisenberg alpha and beta 
amphipathic regions, Karplus-Schulz flexible regions, Emini 

30 surf ace -forming regions and Jameson-Wolf high antigenic 
index regions. Among highly preferred fragments in this 
regard are those that comprise regions of LSGs that combine 
several structural features, such as several of the 
features set out above. In this regard, the regions 

35 defined by selected residues of a LSG polypeptide which all 
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are characterized by amino acid compositions highly 
characteristic of turn-regions, hydrophilic regions, 
flexible-regions, surface -forming regions, and high 
antigenic index- regions, are especially highly preferred 
5 regions. Such regions may be comprised within a larger 
polypeptide or may be by themselves a preferred fragment of 
the present invention, as discussed above. It will be 
appreciated that the term "about" as used in this paragraph 
has the meaning set out above regarding fragments in 
10 general. 

Further preferred regions are those that mediate 
activities of LSG polypeptides. Most highly preferred in 
this regard are fragments that have a chemical, biological 
or other activity of a LSG polypeptide, including those 

15 with a similar activity or an improved activity, or with a 
decreased undesirable activity. Highly preferred in this 
regard are fragments that contain regions that are homologs 
in sequence, or in position, or in both sequence and to 
active regions of related polypeptides, and which include 

20 lung specif ic -binding proteins. Among particularly 
preferred fragments in these regards are truncation 
mutants, as discussed above. 

It will be appreciated that the invention also 
relates to polynucleotides encoding the aforementioned 

25 fragments, polynucleotides that hybridize to 

polynucleotides encoding the fragments, particularly those 
that hybridize under stringent conditions, and 
polynucleotides such as PCR primers for amplifying 
polynucleotides that encode the fragments . In these 

30 regards, preferred polynucleotides are those that 

correspond to the preferred fragments, as discussed above. 
Fusion Proteins 

In one embodiment of the present invention, the LSG 
polypeptides of the present invention are preferably fused 

35 to other proteins. These fusion proteins can be used for a 
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variety of applications. For example, fusion of the present 
polypeptides to His-tag, HA-tag, protein A, IgG domains, 
and maltose binding protein facilitates purification. (See 
also EP A 394,827; Traunecker, et al., Nature 331: 84-86 
5 (1988)) Similarly, fusion to IgG-1, IgG-3, and albumin 

increases the halflife time in vivo. Nuclear localization 
signals fused to the polypeptides of the present invention 
can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can 

10 increase or decrease the activity of a fusion protein. 

Fusion proteins can also create chimeric molecules having 
more than one function. Finally, fusion proteins can 
increase solubility and/or stability of the fused protein 
compared to the non- fused protein. All of these types of 

15 fusion proteins described above can be made in accordance 
with well known protocols. 

For example, a LSG polypeptide can be fused to an IgG 
molecule via the following protocol. Briefly, the human Fc 
portion of the IgG molecule is PCR amplified using primers 

20 that span the 5' and 3' ends of the sequence. These 

primers also have convenient restriction enzyme sites that 
facilitate cloning into an expression vector, preferably a 
mammalian expression vector. For example, if pC4 
(Accession No. 209646) is used, the human Fc portion can be 

25 ligated into the BamHI cloning site. In this protocol, the 
3 ! BamHI site must be destroyed. Next, the vector 
containing the human Fc portion is re-restricted with BamHI 
thereby linearizing the vector, and a LSG polynucleotide of 
the present invention is ligated into this BamHI site. It 

30 is preferred that the polynucleotide is cloned without a 
stop codon, otherwise a fusion protein will not be 
produced. 

If the naturally occurring signal sequence is used to 
produce the secreted protein, pC4 does not need a second 
35 signal peptide. Alternatively, if the naturally occurring 
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signal sequence is not used, the vector can be modified to 
include a heterologous signal sequence. (See, e. g., WO 
96/34891.) 
Diagnostic Assays 
5 The present invention also relates to diagnostic 

assays and methods, both quantitative and qualitative for 
detecting, diagnosing, monitoring, staging and 
prognosticating cancers by comparing levels of LSG in a 
human patient with those of LSG in a normal human control. 

10 For purposes of the present invention, what is meant by LSG 
levels is, among other things, native protein expressed by 
a gene comprising the polynucleotide sequence of SEQ ID NO: 
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 
18, 19, 20 or 74. Exemplary polypeptides encoded by these 

15 polynucleotides are depicted in SEQ ID NO: 75, 76, 77, 78, 
79, 80, 81, 82, 83 and 84. By "LSG" it is also meant 
herein polynucleotides which, due to degeneracy in genetic 
coding, comprise variations in nucleotide sequence as 
compared to SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 

20 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74 but which still 

encode the same protein. The native protein being detected 
may be whole, a breakdown product, a complex of molecules 
or chemically modified. In the alternative, what is meant 
by LSG as used herein, means the native mRNA encoded by a 

25 polynucleotide sequence of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 74, 
levels of the gene comprising the polynucleotide sequence 
of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 17, 18, 19, 20 or 74, or levels of a 

30 polynucleotide which is capable of hybridizing under 

stringent conditions to the antisense sequence of SEQ ID 
NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20 or 74. Such levels are preferably 
determined in at least one of cells, tissues and/or bodily 

35 fluids, including determination of normal and abnormal 
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levels. Thus, for instance, a diagnostic assay in 
accordance with the invention for diagnosing overexpression 
of LSG protein compared to normal control bodily fluids, 
cells, or tissue samples may be used to diagnose the 
5 presence of lung cancer. 

All the methods of the present invention may 
optionally include determining the levels of other cancer 
markers as well as LSG. Other cancer markers, in addition 
to LSG, useful in the present invention will depend on the 
10 cancer being tested and are known to those of skill in the 
art . 

The present invention provides methods for diagnosing 
the presence of lung cancer by analyzing for changes in 
levels of LSG in cells, tissues or bodily fluids compared 
15 with levels of LSG in cells, tissues or bodily fluids of 
preferably the same type from a normal human control, 
wherein an increase in levels of LSG in the patient versus 
the normal human control is associated with the presence of 
lung cancer. 

20 Without limiting the instant invention, typically, 

for a quantitative diagnostic assay a positive result 
indicating the patient being tested has cancer is one in 
which cells, tissues or bodily fluid levels of the cancer 
marker, such as LSG, are at least two times higher, and 

25 most preferably are at least five times higher, than in 
preferably the same cells, tissues or bodily fluid of a 
normal human control . 

The present invention also provides a method of 
diagnosing metastatic lung cancer in a patient having lung 

30 cancer which has not yet metastasized for the onset of 
metastasis. In the method of the present invention, a 
human cancer patient suspected of having lung cancer which 
may have metastasized (but which was not previously known 
to have metastasized) is identified. This is accomplished 

35 by a variety of means known to those of skill in the art. 
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In the present invention, determining the presence of 
LSG levels in cells, tissues or bodily fluid, is 
particularly useful for discriminating between lung cancer 
which has not metastasized and lung cancer which has 
5 metastasized. Existing techniques have difficulty 

discriminating between lung cancer which has metastasized 
and lung cancer which has not metastasized and proper 
treatment selection is often dependent upon such knowledge. 
In the present invention, the cancer marker levels 

10 measured in such cells, tissues or bodily fluid is LSG, and 
are compared with levels of LSG in preferably the same 
cells, tissue or bodily fluid type of a normal human 
control. That is, if the cancer marker being observed is 
just LSG in serum, this level is preferably compared with 

15 the level of LSG in serum of a normal human control. An 
increase in the LSG in the patient versus the normal human 
control is associated with lung cancer which has 
metastasized. 

Without limiting the instant invention, typically, 

20 for a quantitative diagnostic assay a positive result 
indicating the cancer in the patient being tested or 
monitored has metastasized is one in which cells, tissues 
or bodily fluid levels of the cancer marker, such as LSG, 
are at least two times higher, and most preferably are at 

25 least five times higher, than in preferably the same cells, 
tissues or bodily fluid of a normal patient. 

Normal human control as used herein includes a human 
patient without cancer and/or non cancerous samples from 
the patient; in the methods for diagnosing or monitoring 

30 for metastasis, normal human control may preferably also 
include samples from a human patient that is determined by 
reliable methods to have lung cancer which has not 
metastasized. 
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Staging 

The invention also provides a method of staging lung 
cancer in a human patient. The method comprises 
identifying a human patient having such cancer and 
5 analyzing cells, tissues or bodily fluid from such human 
patient for LSG. The LSG levels determined in the patient 
are then compared with levels of LSG in preferably the same 
cells, tissues or bodily fluid type of a normal human 
control, wherein an increase in LSG levels in the human 

10 patient versus the normal human control is associated with 
a cancer which is progressing and a decrease in the levels 
of LSG (but still increased over true normal levels) is 
associated with a cancer which is regressing or in 
remission. 

15 Monitoring 

Further provided is a method of monitoring lung 
cancer in a human patient having such cancer for the onset 
of metastasis. The method comprises identifying a human 
patient having such cancer that is not known to have 

20 metastasized; periodically analyzing cells, tissues or 

bodily fluid from such human patient for LSG; and comparing 
the LSG levels determined in the human patient with levels 
of LSG in preferably the same cells, tissues or bodily 
fluid type of a normal human control, wherein an increase 

25 in LSG levels in the human patient versus the normal human 
control is associated with a cancer which has metastasized. 
In this method, normal human control samples may also 
include prior patient samples. 

Further provided by this invention is a method of 

30 monitoring the change in stage of lung cancer in a human 
patient having such cancer. The method comprises 
identifying a human patient having such cancer; 
periodically analyzing cells, tissues or bodily fluid from 
such human patient for LSG; and comparing the LSG levels 

35 determined in the human patient with levels of LSG in 
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preferably the same cells, tissues or bodily fluid type of 
a normal human control, wherein an increase in LSG levels 
in the human patient versus the normal human control is 
associated with a cancer which is progressing in stage and 
5 a decrease in the levels of LSG is associated with a cancer 
which is regressing in stage or in remission. In this 
method, normal human control samples may also include prior 
patient samples . 

Monitoring a patient for onset of metastasis is 
10 periodic and preferably done on a quarterly basis. 

However, this may be done more or less frequently depending 
on the cancer, the particular patient, and the stage of the 
cancer. 

Prognostic Testing and Clinical Trial Monitoring 

15 The methods described herein can further be utilized 

as prognostic assays to identify subjects having or at risk 
of developing a disease or disorder associated with 
increased levels of LSG. The present invention provides a 
method in which a test sample is obtained from a human 

20 patient and LSG is detected. The presence of higher LSG 
levels as compared to normal human controls is diagnostic 
for the human patient being at risk for developing cancer, 
particularly lung cancer. 

The effectiveness of therapeutic agents to decrease 

25 expression or activity of the LSGs of the invention can 
also be monitored by analyzing levels of expression of the 
LSGs in a human patient in clinical trials or in in vitro 
screening assays such as in human cells. In this way, the 
gene expression pattern can serve as a marker, indicative 

30 of the physiological response of the human patient, or 
cells as the case may be, to the agent being tested. 
Detection of genetic lesions or mutations 

The methods of the present invention can also be used 
to detect genetic lesions or mutations in LSG, thereby 

35 determining if a human with the genetic lesion is at risk 
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for lung cancer or has lung cancer. Genetic lesions can be 
detected, for example, by ascertaining the existence of a 
deletion and/or addition and/or substitution of one or more 
nucleotides from the LSGs of this invention, a chromosomal 
5 rearrangement of LSG, aberrant modification of LSG (such as 
of the methylation pattern of the genomic DNA) , the 
presence of a non-wild type splicing pattern of a mRNA 
transcript of LSG, allelic loss of LSG, and/or 
inappropriate post-translational modification of LSG 

10 protein- Methods to detect such lesions in the LSG of this 
invention are known to those of skill in the art. 

For example, in one embodiment, alterations in a gene 
corresponding to a LSG polynucleotide of the present 
invention are determined via isolation of RNA from entire 

15 families or individual patients presenting with a phenotype 
of interest (such as a disease) is be isolated. cDNA is 
then generated from these RNA samples using protocols known 
in the art. See, e.g. Sambrook et al. (MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Laboratory 

20 Press, Cold Spring Harbor, N.Y. (1989)), which is 

illustrative of the many laboratory manuals that detail 
these techniques. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in 
SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 

25 15, 16, 17, 18, 19, 20 or 74. PCR conditions typically 

consist of 35 cycles at 95°C for 30 seconds; 60-120 seconds 
at 52-58°C; and 60-120 seconds at 70°C, using buffer 
solutions described in Sidransky, D., et al., Science 252: 
706 (1991) . PCR products are sequenced using primers 

30 labeled at their 5 ! end with T4 polynucleotide kinase, 

employing SequiTherm Polymerase (Epicentre Technologies) . 
The intron-exon borders of selected exons are also 
determined and genomic PCR products analyzed to confirm the 
results. PCR products harboring suspected mutations are 

35 then cloned and sequenced to validate the results of the 
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direct sequencing. PCR products are cloned into T- tailed 
vectors as described in Holton, T. A. and Graham, M. W. , 
Nucleic Acids Research, 19 : 1156 (1991) and sequenced with 
T7 polymerase (United States Biochemical) . Affected 
5 individuals are identified by mutations not present in 
unaffected individuals. 

Genomic rearrangements can also be observed as a 
method of determining alterations in a gene corresponding 
to a polynucleotide. In this method, genomic clones are 

10 nick-translated with digoxigenin deoxy-uridine 

5 1 triphosphate (Boehringer Manheim) , and FISH is performed 
as described in Johnson, C. et al., Methods Cell Biol. 35: 
73-99 (1991) . Hybridization with a labeled probe is carried 
out using a vast excess of human DNA for specific 

15 hybridization to the corresponding genomic locus. 
Chromosomes are counterstained with 4 , 6-diamino-2- 
phenylidole and propidium iodide, producing a combination 
of C-and R-bands . Aligned images for precise mapping are 
obtained using a triple-band filter set (Chroma Technology, 

20 Brattleboro, VT) in combination with a cooled charge- 
coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters (Johnson et al., 
Genet. Anal. Tech. Appl., 8: 75 (1991)). Image collection, 
analysis and chromosomal fractional length measurements are 

25 performed using the ISee Graphical Program System 
(Inovision Corporation, Durham, NC) . Chromosome 
alterations of the genomic region hybridized by the probe 
are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic 

30 marker for an associated disease. 
Assay Techniques 

Assay techniques that can be used to determine levels 
of gene expression (including protein levels) , such as LSG 
of the present invention, in a sample derived from a 

35 patient are well known to those of skill in the art. Such 
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assay methods include, without limitation, 
radioimmunoassays, reverse transcriptase PCR (RT-PCR) 
assays, immunohistochemistry assays, in situ hybridization 
assays, competitive-binding assays, Western Blot analyses, 
5 ELISA assays and proteomic approaches: two-dimensional gel 
electrophoresis (2D electrophoresis) and non-gel based 
approaches such as mass spectrometry or protein interaction 
profiling. Among these, ELISAs are frequently preferred to 
diagnose a gene 1 s expressed protein in biological fluids . 

10 An ELISA assay initially comprises preparing an 

antibody, if not readily available from a commercial 
source, specific to LSG, preferably a monoclonal antibody. 
In addition a reporter antibody generally is prepared which 
binds specifically to LSG. The reporter antibody is 

15 attached to a detectable reagent such as radioactive, 

fluorescent or enzymatic reagent, for example horseradish 
peroxidase enzyme or alkaline phosphatase. 

To carry out the ELISA, antibody specific to LSG is 
incubated on a solid support, e.g. a polystyrene dish, that 

20 binds the antibody. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific 
protein such as bovine serum albumin. Next, the sample to 
be analyzed is incubated in the dish, during which time LSG 
binds to the specific antibody attached to the polystyrene 

25 dish. Unbound sample is washed out with buffer. A reporter 
antibody specifically directed to LSG and linked to a 
detectable reagent such as horseradish peroxidase is placed 
in the dish resulting in binding of the reporter antibody 
to any monoclonal antibody bound to LSG. Unattached 

30 reporter antibody is then washed out. Reagents for 

peroxidase activity, including a colorimetric substrate are 
then added to the dish. Immobilized peroxidase, linked to 
LSG antibodies, produces a colored reaction product. The 
amount of color developed in a given time period is 

35 proportional to the amount of LSG protein present in the 
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sample. Quantitative results typically are obtained by 
reference to a standard curve. 

A competition assay can also be employed wherein 
antibodies specific to LSG are attached to a solid support 
5 and labeled LSG and a sample derived from the host are 
passed over the solid support. The amount of label 
detected which is attached to the solid support can be 
correlated to a quantity of LSG in the sample. 

Using all or a portion of a nucleic acid sequence of 

10 LSG of the present invention as a hybridization probe, 

nucleic acid methods can also be used to detect LSG mRNA as 
a marker for lung cancer. Polymerase chain reaction (PCR) 
and other nucleic acid methods, such as ligase chain 
reaction (LCR) and nucleic acid sequence based 

15 amplification (NASBA) , can be used to detect malignant 

cells for diagnosis and monitoring of various malignancies. 
For example, reverse-transcriptase PCR (RT-PCR) is a 
powerful technique which can be used to detect the presence 
of a specific mRNA population in a complex mixture of 

20 thousands of other mRNA species. In RT-PCR, an mRNA 

species is first reverse transcribed to complementary DNA 
(cDNA) with use of the enzyme reverse transcriptase; the 
cDNA is then amplified as in a standard PCR reaction. RT- 
PCR can thus reveal by amplification the presence of a 

25 single species of mRNA. Accordingly, if the mRNA is highly 
specific for the cell that produces it, RT-PCR can be used 
to identify the presence of a specific type of cell . 

Hybridization to clones or oligonucleotides arrayed 
on a solid support (i.e. gridding) can be used to both 

30 detect the expression of and quantitate the level of 

expression of that gene. In this approach, a cDNA encoding 
the LSG gene is fixed to a substrate. The substrate may be 
of any suitable type including but not limited to glass, 
nitrocellulose, nylon or plastic. At least a portion of 

35 the DNA encoding the LSG gene is attached to the substrate 
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and then incubated with the analyte, which may be RNA or a 
complementary DNA (cDNA) copy of the RNA, isolated from the 
tissue of interest. Hybridization between the substrate 
bound DNA and the analyte can be detected and quantitated 
5 by several means including but not limited to radioactive 
labeling or fluorescence labeling of the analyte or a 
secondary molecule designed to detect the hybrid. 
Quantitation of the level of gene expression can be done by 
comparison of the intensity of the signal from the analyte 

10 compared with that determined from known standards. The 
standards can be obtained by in vitro transcription of the 
target gene, quant itating the yield, and then using that 
material to generate a standard curve. 

Of the proteomic approaches, 2D electrophoresis is a 

15 technique well known to those in the art. Isolation of 
individual proteins from a sample such as serum is 
accomplished using sequential separation of proteins by 
different characteristics usually on polyacrylamide gels. 
First, proteins are separated by size using an electric 

20 current. The current acts uniformly on all proteins, so 
smaller proteins move farther on the gel than larger 
proteins. The second dimension applies a current 
perpendicular to the first and separates proteins not on 
the basis of size but on the specific electric charge 

25 carried by each protein. Since no two proteins with 

different sequences are identical on the basis of both size 
and charge, the result of a 2D separation is a square gel 
in which each protein occupies a unique spot. Analysis of 
the spots with chemical or antibody probes, or subsequent 

30 protein microsequencing can reveal the relative abundance 
of a given protein and the identity of the proteins in the 
sample . 

The above tests can be carried out on samples derived 
from a variety of cells, bodily fluids and/or tissue 
35 extracts such as homogenates or solubilized tissue obtained 
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from a patient. Tissue extracts are obtained routinely 
from tissue biopsy and autopsy material. Bodily fluids 
useful in the present invention include blood, urine, 
saliva or any other bodily secretion or derivative thereof. 
5 By blood it is meant to include whole blood, plasma, serum 
or any derivative of blood. 

In Vivo Targeting of LSG/Lung Cancer Therapy 

Identification of this LSG is also useful in the 
rational design of new therapeutics for imaging and 

10 treating cancers, and in particular lung cancer. For 

example, in one embodiment, antibodies which specifically 
bind to LSG can be raised and used in vivo in patients 
suspected of suffering from lung cancer. Antibodies which 
specifically bind LSG can be injected into a patient 

15 suspected of having lung cancer for diagnostic and/or 

therapeutic purposes. Thus, another aspect of the present 
invention provides for a method for preventing the onset 
and treatment of lung cancer in a human patient in need of 
such treatment by administering to the patient an effective 

20 amount of antibody. By "effective amount" it is meant the 
amount or concentration of antibody needed to bind to the 
target antigens expressed on the tumor to cause tumor 
shrinkage for surgical removal, or disappearance of the 
tumor. The binding of the antibody to the overexpressed 

25 LSG is believed to cause the death of the cancer cell 

expressing such LSG. The preparation and use of antibodies 
for in vivo diagnosis and treatment is well known in the 
art. For example, antibody- chelators labeled with Indium- 
Ill have been described for use in the 

30 radioimmunoscintographic imaging of carcinoembryonic 

antigen expressing tumors (Sumerdon et al- Nucl. Med. Biol. 
1990 17:247-254). In particular, these antibody-chelators 
have been used in detecting tumors in patients suspected of 
having recurrent colorectal cancer (Griffin et al. J. Clin. 

35 One. 1991 9:631-640). Antibodies with paramagnetic ions as 
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labels for use in magnetic resonance imaging have also been 
described (Lauffer, R.B. Magnetic Resonance in Medicine 
1991 22:339-342). Antibodies directed against LSG can be 
used in a similar manner. Labeled antibodies which 
5 specifically bind LSG can be injected into patients 
suspected of having lung cancer for the purpose of 
diagnosing or staging of the disease status of the patient. 
The label used will be selected in accordance with the 
imaging modality to be used. For example, radioactive 

10 labels such as Indium-Ill, Technetium- 9 9m or Iodine-131 can 
be used for planar scans or single photon emission computed 
tomography (SPECT) . Positron emitting labels such as 
Fluorine- 19 can be used in positron emission tomography. 
Paramagnetic ions such as Gadlinium (III) or Manganese (II) 

15 can be used in magnetic resonance imaging (MRI) . Presence 
of the label, as compared to imaging of normal tissue, 
permits determination of the spread of the cancer. The 
amount of label within an organ or tissue also allows 
determination of the presence or absence of cancer in that 

20 organ or tissue. 

Antibodies which can be used in in vivo methods 
include polyclonal, monoclonal and omniclonal antibodies 
and antibodies prepared via molecular biology techniques. 
Antibody fragments and aptamers and single -stranded 

25 oligonucleotides such as those derived from an in vitro 
evolution protocol referred to as SELEX and well known to 
those skilled in the art can also be used. 
Screening Assays 

The present invention also provides methods for 

30 identifying modulators which bind to LSG protein or have a 
modulatory effect on the expression or activity of LSG 
protein. Modulators which decrease the expression or 
activity of LSG protein are believed to be useful in 
treating lung cancer. Such screening assays are known to 
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those of skill in the art and include, without limitation, 
cell-based assays and cell free assays. 

Small molecules predicted via computer imaging to 
specifically bind to regions of LSG can also be designed, 
5 synthesized and tested for use in the imaging and treatment 
of lung cancer. Further, libraries of molecules can be 
screened for potential anticancer agents by assessing the 
ability of the molecule to bind to the LSGs identified 
herein. Molecules identified in the library as being 

10 capable of binding to LSG are key candidates for further 
evaluation for use in the treatment of lung cancer. In a 
preferred embodiment, these molecules will downregulate 
expression and/or activity of LSG in cells. 
Adoptive Immunotherapy and Vaccines 

15 Adoptive immunotherapy of cancer refers to a 

therapeutic approach in which immune cells with an 
antitumor reactivity are administered to a tumor-bearing 
host, with the aim that the cells mediate either directly 
or indirectly, the regression of an established tumor. 

20 Transfusion of lymphocytes, particularly T lymphocytes, 

falls into this category and investigators at the National 
Cancer Institute (NCI) have used autologous reinfusion of 
peripheral blood lymphocytes or tumor- infiltrating 
lymphocytes (TIL) , T cell cultures from biopsies of 

25 subcutaneous lymph nodules, to treat several human cancers 
(Rosenberg, S. A., U.S. Patent No. 4,690,914, issued Sep. 
1,' 1987; Rosenberg, S. A., et al., 1988, N. England J. Med. 
319:1676-1680) . 

The present invention relates to compositions and 

30 methods of adoptive immunotherapy for the prevention and/or 
treatment of primary and metastatic lung cancer in humans 
using macrophages sensitized to the antigenic LSG 
molecules, with or without non-covalent complexes of heat 
shock protein (hsp) . Antigenicity or immunogenicity of the 

35 LSG is readily confirmed by the ability of the LSG protein 
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or a fragment thereof to raise antibodies or educate naive 
effector cells, which in turn lyse target cells expressing 
the antigen (or epitope) . 

Cancer cells are, by definition, abnormal and contain 
5 proteins which should be recognized by the immune system as 
foreign since they are not present in normal tissues. 
However, the immune system often seems to ignore this 
abnormality and fails to attack tumors. The foreign LSG 
proteins that are produced by the cancer cells can be used 

10 to reveal their presence. The LSG is broken into short 
fragments, called tumor antigens, which are displayed on 
the surface of the cell. These tumor antigens are held or 
presented on the cell surface by molecules called MHC, of 
which there are two types: class I and II. Tumor antigens 

15 in association with MHC class I molecules are recognized by 
cytotoxic T cells while antigen-MHC class II complexes are 
recognized by a second subset of T cells called helper 
cells. These cells secrete cytokines which slow or stop 
tumor growth and help another type of white blood cell, B 

20 cells, to make antibodies against the tumor cells. 

In adoptive immunotherapy, T cells or other antigen 
presenting cells (APCs) are stimulated outside the body (ex 
vivo) , using the tumor specific LSG antigen. The 
stimulated cells are then reinfused into the patient where 

25 they attack the cancerous cells. Research has shown that 
using both cytotoxic and helper T cells is far more 
effective than using either subset alone. Additionally, 
the LSG antigen may be complexed with heat shock proteins 
to stimulate the APCs as described in U.S. Patent No. 

30 5,985,270. 

The APCs can be selected from among those antigen 
presenting cells known in the art including, but not 
limited to, macrophages, dendritic cells, B lymphocytes, 
and a combination thereof, and are preferably macrophages. 

35 In a preferred use, wherein cells are autologous to the 
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individual, autologous immune cells such as lymphocytes, 
macrophages or other APCs are used to circumvent the issue 
of whom to select as the donor of the immune cells for 
adoptive transfer. Another problem circumvented by use of 
5 autologous immune cells is graft versus host disease which 
can be fatal if unsuccessfully treated. 

In adoptive immunotherapy with gene therapy, DNA of 
the LSG can be introduced into effector cells similarly as 
in conventional gene therapy. This can enhance the 

10 cytotoxicity of the effector cells to tumor cells as they 
have been manipulated to produce the antigenic protein 
resulting in improvement of the adoptive immunotherapy. 

LSG antigens of this invention are also useful as 
components of lung cancer vaccines. The vaccine comprises 

15 an immunogenically stimulatory amount of a LSG antigen. 

Immunogenically stimulatory amount refers to that amount of 
antigen that is able to invoke the desired immune response 
in the recipient for the amelioration, or treatment of lung 
cancer. Effective amounts may be determined empirically by 

20 standard procedures well known to those skilled in the art. 
The LSG antigen may be provided in any one of a 
number of vaccine formulations which are designed to induce 
the desired type of immune response, e.g., antibody and/or 
cell mediated. Such formulations are known in the art and 

25 include, but are not limited to, formulations such as those 
described in U.S. Patent 5,585,103. Vaccine formulations 
of the present invention used to stimulate immune responses 
can also include pharmaceutically acceptable adjuvants. 
Vectors, host cells, expression 

30 The present invention also relates to vectors which 

include polynucleotides of the present invention, host 
cells which are genetically engineered with vectors of the 
invention and the production of polypeptides of the 
invention by recombinant techniques. 
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Host cells can be genetically engineered to 
incorporate LSG polynucleotides and express LSG 
polypeptides of the present invention. For instance, LSG 
polynucleotides may be introduced into host cells using 
5 well known techniques of infection, transduction, 

transfection, transvection and transformation. The LSG 
polynucleotides may be introduced alone or with other 
polynucleotides. Such other polynucleotides may be 
introduced independently, co- introduced or introduced 

10 joined to the LSG polynucleotides of the invention. 

For example, LSG polynucleotides of the invention may 
be transfected into host cells with another, separate, 
polynucleotide encoding a selectable marker, using standard 
techniques for co-transf ection and selection in, for 

15 instance, mammalian cells. In this case, the 

polynucleotides generally will be stably incorporated into 
the host cell genome. 

Alternatively, the LSG polynucleotide may be joined 
to a vector containing a selectable marker for propagation 

20 in a host. The vector construct may be introduced into 

host cells by the aforementioned techniques. Generally, a 
plasmid vector is introduced as DNA in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a 
charged lipid. Electroporation also may be used to 

25 introduce LSG polynucleotides into a host. If the vector 
is a virus, it may be packaged in vitro or introduced into 
a packaging cell and the packaged virus may be transduced 
into cells. A wide variety of well known techniques 
conducted routinely by those of skill in the art are 

3 0 suitable for making LSG polynucleotides and for introducing 
LSG polynucleotides into cells in accordance with this 
aspect of the invention. Such techniques are reviewed at 
length in reference texts such as Sambrook et al., 
previously cited herein. 
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Vectors which may be used in the present invention 
include, for example, plasmid vectors, single- or double- 
stranded phage vectors, and single- or double- stranded RNA 
or DNA viral vectors. Such vectors may be introduced into 
5 cells as polynucleotides, preferably DNA, by well known 
techniques for introducing DNA and RNA into cells. The 
vectors, in the case of phage and viral vectors, also may 
be and preferably are introduced into cells as packaged or 
encapsidated virus by well known techniques for infection 

10 and transduction. Viral vectors may be replication 

competent or replication defective. In the latter case 
viral propagation generally will occur only in 
complementing host cells. 

Preferred vectors for expression of polynucleotides 

15 and polypeptides of the present invention include, but are 
not limited to, vectors comprising cis-acting control 
regions effective for expression in a host operatively 
linked to the polynucleotide to be expressed. Appropriate 
trans-acting factors either are supplied by the host, 

20 supplied by a complementing vector or supplied by the 
vector itself upon introduction into the host. 

In certain preferred embodiments in this regard, the 
vectors provide for specific expression. Such specific 
expression may be inducible expression or expression only 

25 in certain types of cells or both inducible and cell- 

specific. Particularly preferred among inducible vectors 
are vectors that can be induced to express by environmental 
factors that are easy to manipulate, such as temperature 
and nutrient additives. A variety of vectors suitable to 

30 this aspect of the invention, including constitutive and 
inducible expression vectors for use in prokaryotic and 
eukaryotic hosts, are well known and employed routinely by 
those of skill in the art. 

The engineered host cells can be cultured in 

35 conventional nutrient media which may be modified as 
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appropriate for, inter alia, activating promoters, 
selecting transf ormants or amplifying genes. Culture 
conditions such as temperature, pH and the like, previously- 
used with the host cell selected for expression, generally 
5 will be suitable for expression of LSG polypeptides of the 
present invention. 

A great variety of expression vectors can be used to 
express LSG polypeptides of the invention. Such vectors 
include chromosomal, episomal and virus-derived vectors. 

10 Vectors may be derived from bacterial plasmids, from 

bacteriophage, from yeast episomes, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova 
viruses, such as SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and 

15 from combinations thereof such as those derived from 

plasmid and bacteriophage genetic elements, such as cosmids 
and phagemids. All may be used for expression in accordance 
with this aspect of the present invention. Generally, any 
vector suitable to maintain, propagate or express 

20 polynucleotides to express a polypeptide in a host may be 
used for expression in this regard. 

The appropriate DNA sequence may be inserted into the 
vector by any of a variety of well-known and routine 
techniques. In general, a DNA sequence for expression is 

25 joined to an expression vector by cleaving the DNA sequence 
and the expression vector with one or more restriction 
endonucleases and then joining the restriction fragments 
together using T4 DNA ligase. Procedures for restriction 
and ligation that can be used to this end are well known 

30 and routine to those of skill. Suitable procedures in this 
regard, and for constructing expression vectors using 
alternative techniques, which also are well known and 
routine to those skill, are set forth in great detail in 
Sambrook et al. cited elsewhere herein. 
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The DNA sequence in the expression vector is 
operatively linked to appropriate expression control 
sequence (s), including, for instance, a promoter to direct 
mRNA transcription. Representative promoters include the 
5 phage lambda PL promoter, the E. coli lac, trp and tac 

promoters, the SV40 early and late promoters, and promoters 
of retroviral LTRs, to name just a few of the well-known 
promoters . It will be understood that numerous promoters 
not mentioned are also suitable for use in this aspect of 

10 the invention and are well known and readily may be 

employed by those of skill in the manner illustrated by the 
discussion and the examples herein. 

In general, expression constructs will contain sites 
for transcription initiation and termination, and, in the 

15 transcribed region, a ribosome binding site for 

translation. The coding portion of the mature transcripts 
expressed by the constructs will include a translation 
initiating AUG at the beginning and a termination codon 
appropriately positioned at the end of the polypeptide to 

20 be translated. 

In addition, the constructs may contain control 
regions that regulate as well as engender expression. 
Generally, in accordance with many commonly practiced 
procedures, such regions will operate by controlling 

25 transcription, such as repressor binding sites and 
enhancers, among others. 

Vectors for propagation and expression generally will 
include selectable markers. Such markers also may be 
suitable for amplification or the vectors may contain 

30 additional markers for this purpose. In this regard, the 
expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for 
selection of transformed host cells. Preferred markers 
include dihydrof olate reductase or neomycin resistance for 

35 eukaryotic cell culture, and tetracycline or ampicillin 
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resistance genes for culturing in E. coli and other 
bacteria. 

The vector containing the appropriate DNA sequence as 
described elsewhere herein, as well as an appropriate 
5 promoter, and other appropriate control sequences, may be ■ 
introduced into an appropriate host using a variety of well 
known techniques suitable to expression therein of a 
desired polypeptide- Representative examples of 
appropriate hosts include bacterial cells, such as E. coli, 

10 Streptomyces and Salmonella typhimurium cells; fungal 

cells, such as yeast cells; insect cells such as Drosophila 
S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Hosts for a 
great variety of expression constructs are well known, and 

15 those of skill will be enabled by the present disclosure 
readily to select a host for expressing a LSG polypeptide 
in accordance with this aspect of the present invention. 

More particularly, the present invention also 
includes recombinant constructs, such as expression 

20 constructs, comprising one or more of the sequences 

described above. The constructs comprise a vector, such as 
a plasmid or viral vector, into which such LSG sequence of 
the invention has been inserted. The. sequence may be 
inserted in a forward or reverse orientation. In certain 

25 preferred embodiments in this regard, the construct further 
comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of 
suitable vectors and promoters are known to those of skill 
in the art, and there are many commercially available 

30 vectors suitable for use in the present invention. 

The following vectors, which are commercially 
available, are provided by way of example. Among vectors 
preferred for use in bacteria are pQE70, pQE60 and pQE-9, 
available from Qiagen; pBS vectors, Phagescript vectors, 

35 Bluescript vectors, pNH8A, pNH16a, pNH18A, pNH46A, 
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available from Stratagene; and ptrc99a, pKK223-3, pKK233-3, 
pDR540, pRITS available from Pharmacia. Among preferred 
eukaryotic vectors are PWLNEO, pSV2CAT, pOG44, pXTl and pSG 
available from Stratagene; and pSVK3, pBPV, pMSG and pSVL 
5 available from Pharmacia. These vectors are listed solely 
by way of illustration of the many commercially available 
and well known vectors that are available to those of skill 
in the art for use in accordance with this aspect of the 
present invention. It will be appreciated by those of 

10 skill in the art upon reading this disclosure that any 
other plasmid or vector suitable for introduction, 
maintenance, propagation and/or expression of a LSG 
polynucleotide or polypeptide of the invention in a host 
may be used in this aspect of the invention. 

15 Promoter regions can be selected from any desired 

gene using vectors that contain a reporter transcription 
unit lacking a promoter region, such as a chloramphenicol 
acetyl transferase ("cat") transcription unit, downstream 
of a restriction site or sites for introducing a candidate 

20 promoter fragment; i.e., a fragment that may contain a 
promoter. As is well known, introduction into the vector 
of a promoter- containing fragment at the restriction site 
upstream of the cat gene engenders production of CAT 
activity detectable by standard CAT assays. Vectors 

25 suitable to this end are well known and readily available. 
Two such vectors are pKK232-8 and pCM7. Thus, promoters 
for expression of LSG polynucleotides of the present 
invention include, not only well known and readily 
available promoters, but also promoters that readily may be 

3 0 obtained by the foregoing technique, using a reporter gene. 
Among known bacterial promoters suitable for 
expression of polynucleotides and polypeptides in 
accordance with the present invention are the E. coli laci 
and lacZ promoters, the T3 and T7 promoters, the gpt 

35 promoter, the lambda PR, PL promoters and the trp promoter. 
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Among known eukaryotic promoters suitable in this regard 
are the CMV immediate early promoter, the HSV thymidine 
kinase promoter, the early and late SV40 promoters, the 
promoters of retroviral LTRs, such as those of the Rous 
5 sarcoma virus ("RSV"), and metal lothionein promoters, such 
as the mouse metallothionein-I promoter. 

Selection of appropriate vectors and promoters for 
expression in a host cell is a well known procedure and the 
requisite techniques for expression vector construction, 

10 introduction of the vector into the host and expression in 
the host are routine skills in the art. 

The present invention also relates to host cells 
containing the above -described constructs. The host cell 
can be a higher eukaryotic cell, such as a mammalian cell, 

15 or a lower eukaryotic cell, such as a yeast cell. 

Alternatively, the host cell can be a prokaryotic cell, 
such as a bacterial cell. 

Introduction of the construct into the host cell can 
be effected by calcium phosphate transf ection, DEAE-dextran 

20 mediated transf ection, cationic lipid-mediated 

transf ection, electroporation, transduction, infection or 
other methods. Such methods are described in many standard 
laboratory manuals, such as Davis et al. BASIC METHODS IN 
MOLECULAR BIOLOGY, (1986) . 

25 Constructs in host cells can be used in a 

conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, LSG polypeptides 
of the invention can be synthetically produced by 
conventional peptide synthesizers. 

30 Mature proteins can be expressed in mammalian cells, 

yeast, bacteria, or other cells under the control of 
appropriate promoters. Cell-free translation systems can 
also be employed to produce such proteins using RNAs 
derived from the DNA constructs of the present invention. 

35 Appropriate cloning and expression vectors for use with 
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prokaryotic and eukaryotic hosts are described by Sambrook 
et al. cited elsewhere herein. 

Generally, recombinant expression vectors will 
include origins of replication, a promoter derived from a 
5 highly-expressed gene to direct transcription of a 

downstream structural sequence, and a selectable marker to 
permit isolation of vector containing cells after exposure 
to the vector. Among suitable promoters are those derived 
from the genes that encode glycolytic enzymes such as 3- 

10 phosphoglycerate kinase ( "PGK" ) , a-factor, acid 

phosphatase, and heat shock proteins, among others. 
Selectable markers include the ampicillin resistance gene 
of E. coli and the trpl gene of S. cerevisiae. 

Transcription of DNA encoding the LSG polypeptides of 

15 the present invention by higher eukaryotes may be increased 
by inserting an enhancer sequence into the vector. 
Enhancers are cis-acting elements of DNA, usually about 
from 10 to 300 base pairs (bp) that act to increase 
transcriptional activity of a promoter in a given host 

20 cell-type. Examples of enhancers include the SV40 
enhancer, which is located on the late side of the 
replication origin at bp 100 to 270, the cytomegalovirus 
early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

25 A polynucleotide of the present invention, encoding a 

heterologous structural sequence of a LSG polypeptide of 
the present invention, generally will be inserted into the 
vector using standard techniques so that it is operably 
linked to the promoter for expression. The polynucleotide 

30 will be positioned so that the transcription start site is 
located appropriately 5' to a ribosome binding site. The 
ribosome binding site will be 5' to the AUG that initiates 
translation of the polypeptide to be expressed. Generally, 
there will be no other open reading frames that begin with 

35 an initiation codon, usually AUG, lying between the 
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ribosome binding site and the initiating AUG. Also, 
generally, there will be a translation stop codon at the 
end of the polypeptide and there will be a polyadenylation 
signal and a transcription termination signal appropriately 
5 disposed at the 3' end of the transcribed region. 

Appropriate secretion signals may be incorporated 
into the expressed polypeptide for secretion of the 
translated protein into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the 

10 extracellular environment. The signals may be endogenous 
to the polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, 
such as a fusion protein, and may include not only 
secretion signals but also additional heterologous 

15 functional regions. Thus, for instance, a region of 

additional amino acids, particularly charged amino acids, 
may be added to the N- terminus of the polypeptide to 
improve stability and persistence in the host cell during 
purification or during subsequent handling and storage. A 

20 region also may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final 
preparation of the polypeptide. The addition of peptide 
moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate 

25 purification, among others, are familiar and routine 
techniques in the art . 

Suitable prokaryotic hosts for propagation, 
maintenance or expression of LSG polynucleotides and 
polypeptides in accordance with the invention include 

30 Escherichia coli, Bacillus subtilis and Salmonella 

typhimurium . Various species of Pseudomonas, Streptomyces, 
and Staphylococcus are suitable hosts in this regard. Many 
other hosts also known to those of skill may also be 
employed in this regard. 



WO 02/18576 



PCT/US01/26684 



- 56 - 

As a representative, but non- limiting example, useful 
expression vectors for bacterial use can comprise a 
selectable marker and bacterial origin of replication 
derived from commercially available plasmids comprising 
5 genetic elements of the well known cloning vector pBR322 . 
Such commercial vectors include, for example, pKK223-3 
(Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 
(Promega Biotec, Madison, Wis., USA). These pBR322 
"backbone" sections are combined with an appropriate 

10 promoter and the structural sequence to be expressed. 
Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, 
where the selected promoter is inducible it is induced by 
appropriate means (e.g., temperature shift or exposure to 

15 chemical inducer) and cells are cultured for an additional 
period. Cells typically then are harvested by 
centrifugation, disrupted by physical or chemical means, 
and the resulting crude extract retained for further 
purification. Microbial cells employed in expression of 

20 proteins can be disrupted by any convenient method, 
including freeze-thaw cycling, sonication, mechanical 
disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. 

Various mammalian cell culture systems can be 

25 employed for expression, as well. An exemplary mammalian 
expression systems is the COS -7 line of monkey kidney 
fibroblasts described in Gluzman et al., Cell 23: 175 
(1981) . Other mammalian cell lines capable of expressing a 
compatible vector include for example, the C127, 3T3, CHO, 

30 HeLa, human kidney 293 and BHK cell lines. Mammalian 
expression vectors comprise an origin of replication, a 
suitable promoter and enhancer, and any ribosome binding 
sites, polyadenylation sites, splice donor and acceptor 
sites, transcriptional termination sequences, and 5' 

35 flanking non- transcribed sequences that are necessary for 
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expression. In certain preferred embodiments in this regard 
DNA sequences derived from the SV40 splice sites, and the 
SV40 polyadenylation sites are used for required non- 
transcribed genetic elements of these types. 
5 LSG polypeptides can be recovered and purified from 

recombinant cell cultures by well-known methods including 
ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
10 affinity chromatography, hydroxyl apatite chromatography and 
lectin chromatography. Most preferably, high performance 
liquid chromatography ("HPLC") is employed for 
purification. Well known techniques for refolding proteins 
may be employed to regenerate active conformation when the 
15 polypeptide is denatured during isolation and or 
purification. 

LSG polypeptides of the present invention include 
naturally purified products, products of chemical synthetic 
procedures, and products produced by recombinant techniques 
20 from a prokaryotic or eukaryotic host, including, for 
example, bacterial, yeast, higher plant, insect and 
mammalian cells. Depending upon the host employed in a 
recombinant production procedure, the LSG polypeptides of 
the present invention may be glycosylated or may be non- 
25 glycosylated. In addition, LSG polypeptides of the 

invention may also include an initial modified methionine 
residue, in some cases as a result of host-mediated 
processes . 

LSG polynucleotides and polypeptides may be used in 
30 accordance with the present invention for a variety of 
applications, particularly those that make use of the 
chemical and biological properties of the LSGs. Additional 
applications relate to diagnosis and to treatment of 
disorders of cells, tissues and organisms. These aspects of 
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the invention are illustrated further by the following 

discussion. 

Polynucleotide assays 

As discussed in some detail supra, this invention is 
5 also related to the use of LSG polynucleotides to detect 
complementary polynucleotides such as, for example, as a 
diagnostic reagent. Detection of a mutated form of LSG 
associated with a dysfunction will provide a diagnostic 
tool that can add to or define a diagnosis of a disease or 
10 susceptibility to a disease which results from under- 

expression, over-expression or altered expression of a LSG, 
such as, for example, a susceptibility to inherited lung 
cancer . 

Individuals carrying mutations in a human LSG gene 

15 may be detected at the DNA level by a variety of 

techniques. Nucleic acids for diagnosis may be obtained 
from a patient's cells, such as from blood, urine, saliva, 
tissue biopsy and autopsy material. The genomic DNA may be 
used directly for detection or may be amplified 

20 enzymatically using PCR prior to analysis (Saiki et al., 

Nature, 324: 163-166 (1986)). RNA or cDNA may also be used 
in a similar manner. As an example, PCR primers 
complementary to a LSG polynucleotide of SEQ ID NO: 1, 2, 
3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 

25 19, 20 or 74 can be used to identify and analyze LSG 
expression and mutations. For example, deletions and 
insertions can be detected by a change in size of the 
amplified product in comparison to the normal genotype. 
Point mutations can be identified by hybridizing amplified 

30 DNA to radiolabeled LSG RNA or alternatively, radiolabeled 
LSG antisense DNA sequences. Perfectly matched sequences 
can be distinguished from mismatched duplexes by RNase A 
digestion or by differences in melting temperatures. 

Sequence differences between a reference gene and 

35 genes having mutations also may be revealed by direct DNA 
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sequencing. In addition, cloned DNA segments may be 
employed as probes to detect specific DNA segments. The 
sensitivity of such methods can be greatly enhanced by 
appropriate use of PCR or another amplification method. 
5 For example, a sequencing primer is used with double- 
stranded PCR product or a single- stranded template molecule 
generated by a modified PCR. The sequence determination is 
performed by conventional procedures with radiolabeled 
nucleotide or by automatic sequencing procedures with 

10 fluorescent -tags . 

Genetic testing based on DNA sequence differences may 
be achieved by detection of alterations in electrophoretic 
mobility of DNA fragments in gels, with or without 
denaturing agents. Small sequence deletions and insertions 

15 can be visualized by high resolution gel electrophoresis. 
DNA fragments of different sequences may be distinguished 
on denaturing formamide gradient gels in which the 
mobilities of different DNA fragments are retarded in the 
gel at different positions according to their specific 

20 melting or partial melting temperatures (see, e.g., Myers 
et al., Science, 230: 1242 (1985)). 

Sequence changes at specific locations also may be 
revealed by nuclease protection assays, such as RNase and 
SI protection or the chemical cleavage method (e.g., Cotton 

25 et al., Proc. Natl. Acad. Sci., USA, 85: 4397-4401 (1985)). 

Thus, the detection of a specific DNA sequence may be 
achieved by methods such as hybridization, RNase 
protection, chemical cleavage, direct DNA sequencing or the 
use of restriction enzymes, (e.g., restriction fragment 

30 length polymorphisms ( "RFLP" ) and Southern blotting of 
genomic DNA. In addition to more conventional gel- 
electrophoresis and DNA sequencing, mutations also can be 
detected by in situ analysis. 
Chromosome assays 
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The LSG sequences of the present invention are also 
valuable for chromosome identification. There is a need 
for identifying particular sites on the chromosome and few 
chromosome marking reagents based on actual sequence data 
5 (repeat polymorphisms) are presently available for marking 
chromosomal location. Each LSG sequence of the present 
invention is specifically targeted to and can hybridize 
with a particular location on an individual human 
chromosome. Thus, the LSGs can be used in the mapping of 

10 DNAs to chromosomes, an important first step in correlating 
sequences with genes associated with disease. 

In certain preferred embodiments in this regard, the 
cDNA herein disclosed is used to clone genomic DNA of a LSG 
of the present invention. This can be accomplished using a 

15 variety of well known techniques and libraries, which 

generally are available commercially. The genomic DNA is 
used for in situ chromosome mapping using well known 
techniques for this purpose. 

In some cases, sequences can be mapped to chromosomes 

20 by preparing PCR primers (preferably 15-25 bp) from the 
cDNA. Computer analysis of the 3' untranslated region of 
the gene is used to rapidly select primers that do not span 
more than one exon in the genomic DNA, thus complicating 
the amplification process. These primers are then used for 

25 PCR screening of somatic cell hybrids containing individual 
human chromosomes. Only those hybrids containing the human 
gene corresponding to the primer will yield an amplified 
fragment . 

PCR mapping of somatic cell hybrids is a rapid 
30 procedure for assigning a particular DNA to a particular 
chromosome. Using the present invention with the same 
oligonucleotide primers, sublocalization can be achieved 
with panels of fragments from specific chromosomes or pools 
of large genomic clones in an analogous manner. Other 
35 mapping strategies that can similarly be used to map to its 
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chromosome include in situ hybridization, prescreening with 
labeled flow-sorted chromosomes and preselection by- 
hybridization to construct chromosome specif ic-cDNA 
libraries . 

5 Fluorescence in situ hybridization ("FISH") of a cDNA 

clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 50 or 60 bp. 
This technique is described by Verma et al. (HUMAN 

10 CHROMOSOMES: A MANUAL OF BASIC TECHNIQUES , Pergamon Press, 
New York (1988) ) * 

Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the sequence 
on the chromosome can be correlated with genetic map data. 

15 Such data are found, for example, in V. McKusick, MENDEL IAN 
INHERITANCE IN MAN, available on line through Johns Hopkins 
University, Welch Medical Library. The relationship 
between genes and diseases that have been mapped to the 
same chromosomal region are then identified through linkage 

20 analysis (coinheritance of physically adjacent genes) . 

Next, it is necessary to determine the differences in 
the cDNA or genomic sequence between affected and 
unaffected individuals. If a mutation is observed in some 
or all of the affected individuals but not in any normal 

25 individuals, then the mutation is likely to be the 
causative agent of the disease. 

With current resolution of physical mapping and 
genetic mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 

30 of between 50 and 500 potential causative genes. (This 
assumes 1 megabase mapping resolution and one gene per 20 
kb) . 

Polypeptide assays 

As described in some detail supra, the present 
35 invention also relates to diagnostic assays such as 
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quantitative and diagnostic assays for detecting levels of 
LSG polypeptide in cells and tissues, and biological fluids 
such as blood and urine, including determination of normal 
and abnormal levels. Thus, for instance, a diagnostic assay 
5 in accordance with the present invention for detecting 
over- expression or under-expression of a LSG polypeptide 
compared to normal control tissue samples may be used to 
detect the presence of neoplasia. Assay techniques that 
can be used to determine levels of a protein, such as a LSG 

10 polypeptide of the present invention, in a sample derived 
from a host are well-known to those of skill in the art. 
Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and ELISA assays. 
Among these ELISAs frequently are preferred. 

15 For example, antibody- sandwich ELISAs are used to 

detect polypeptides in a sample, preferably a biological 
sample- Wells of a microtiter plate are coated with 
specific antibodies, at a final concentration of 0.2 to 10 
/zg/ml. The antibodies are either monoclonal or polyclonal 

20 and are produced by methods as described herein. The wells 
are blocked so that non-specific binding of the polypeptide 
to the well is reduced. The coated wells are then 
incubated for > 2 hours at room temperature with a sample 
containing the LSG polypeptide. Preferably, serial 

25 dilutions of the sample should be used to validate results. 
The plates are then washed three times with deionized or 
distilled water to remove unbounded polypeptide. Next, 50 
/il of specific antibody -alkaline phosphatase conjugate, at 
a concentration of 25-400 ng, is added and incubated for 2 

30 hours at room temperature. The plates are again washed 
three times with deionized or distilled' water to remove 
unbounded conjugate. 4-methylumbellif eryl phosphate (MUP) 
or p-nitrophenyl phosphate (NPP) substrate solution (75/il) 
is then added to each well and the plate is incubated 1 

35 hour at room temperature. The reaction is measured by a 



WO 02/18576 



PCT/US01/26684 



- 63 - 

microtiter plate reader. A standard curve is prepared using 
serial dilutions of a control sample, and polypeptide 
concentration is plotted on the X-axis (log scale) while 
fluorescence or absorbance is plotted on the Y-axis (linear 
5 scale) . The concentration of the LSG polypeptide in the 
sample is interpolated using the standard curve. 
Antibodies 

As discussed in some detail supra, LSG polypeptides, 
their fragments or other derivatives, or analogs thereof, 

10 or cells expressing them can be used as an immunogen to 
produce antibodies thereto. These antibodies can be 
polyclonal or monoclonal antibodies. The present invention 
also includes chimeric, single chain, and humanized 
antibodies, as well as Fab fragments, or the product of an 

15 Fab expression library. Various procedures known in the art 
may be used for the production of such antibodies and 
fragments . 

A variety of methods for antibody production are set 
forth in Current Protocols, Chapter 2. 

20 For example, cells expressing a LSG polypeptide of 

the present invention can be administered to an animal to 
induce the production of sera containing polyclonal 
antibodies. In a preferred method, a preparation of the 
secreted protein is prepared and purified to render it 

25 substantially free of natural contaminants. This 

preparation is then introduced into an animal in order to 
produce polyclonal antisera of greater specific activity. 
The antibody obtained will bind with the LSG polypeptide 
itself. In this manner, even a sequence encoding only a 

30 fragment of the LSG polypeptide can be used to generate 
antibodies binding the whole native polypeptide. Such 
antibodies can then be used to isolate the LSG polypeptide 
from tissue expressing that LSG polypeptide. 

Alternatively, monoclonal antibodies can be prepared. 

35 Examples of techniques for production of monoclonal 
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antibodies include, but are not limited to, the hybridoma 
technique (Kohler, G. and Milstein, C, Nature 256: 495-497 
(1975) , the trioma technique, the human B-cell hybridoma 
technique (Kozbor et al., Immunology Today 4: 72 (1983) and 
5 (Cole et al., pg. 77-96 in MONOCLONAL ANTIBODIES AND CANCER 
THERAPY, Alan R. Liss, Inc. (1985). The EBV- hybridoma 
technique is useful in production of human monoclonal 
antibodies . 

Hybridoma technologies have also been described by 

10 Khler et al. (Eur. J. Immunol. 6: 511 (1976)) Khler et al. 
(Eur. J. Immunol. 6: 292 (1976)) and Hammerling et al . (in: 
Monoclonal Antibodies and T-Cell Hybridomas, Elsevier, N. 
Y., pp. 563-681 (1981)). In general, such procedures 
involve immunizing an animal (preferably a mouse) with LSG 

15 polypeptide or, more preferably, with a secreted LSG 

polypeptide -expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is 
preferable to culture cells in Earle's modified Eagle's 
medium supplemented with 10% fetal bovine serum 

20 (inactivated at about 56°C) , and supplemented with about 10 
g/1 of nonessential amino acids, about 1,000 U/ml of 
penicillin, and about 100 /ig/ml of streptomycin. The 
splenocytes of such mice are extracted and fused with a 
suitable myeloma cell line. Any suitable myeloma cell line 

25 may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell 
line (SP20) , available from the ATCC. After fusion, the 
resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described 

30 by Wands et al. (Gastroenterology 80: 225-232 (1981).). 
The hybridoma cells obtained through such a selection are 
then assayed to identify clones which secrete antibodies 
capable of binding the polypeptide. 

Alternatively, additional antibodies capable of 

35 binding to the polypeptide can be produced in a two-step 
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procedure using anti -idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves 
antigens, and therefore, it is possible to obtain an 
antibody which binds to a second antibody. In accordance 
5 with this method, protein specific antibodies are used to 
immunize an animal, preferably a mouse. The splenocytes of 
such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones 
which produce an antibody whose ability to bind to the 

10 protein-specific antibody can be blocked by the 

polypeptide. Such antibodies comprise anti -idiotypic 
antibodies to the protein specific antibody and can be used 
to immunize an animal to induce formation of further 
protein-specific antibodies. 

15 Techniques described for the production of single 

chain antibodies (U.S. Patent 4,946,778) can also be 
adapted to produce single chain antibodies to immunogenic 
polypeptide products of this invention. Also, transgenic 
mice, as well as other nonhuman transgenic animals, may be 

20 used to express humanized antibodies to immunogenic 
polypeptide products of this invention. 

It will be appreciated that Fab, F(ab ! )2 and other 
fragments of the antibodies of the present invention may 
also be used according to the methods disclosed herein. 

25 Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab t )2 fragments). 
Alternatively, secreted protein-binding fragments can be 
produced through the application of recombinant DNA 

30 technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be 
preferable to use "humanized" chimeric monoclonal 
antibodies. Such antibodies can be produced using genetic 
constructs derived from hybridoma cells producing the 

35 monoclonal antibodies described above. Methods for 
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producing chimeric antibodies are known in the art (See, 
for review, Morrison, Science 229: 1202 (1985); Oi et al . , 
BioTechniques 4: 214 (1986); Cabilly et al., U. S. Patent 
4,816,567; Taniguchi et al., EP 171496; Morrison et al., EP 
5 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
8702671; Boulianne et al., Nature 312: 643 (1984); 
Neuberger et al., Nature 314: 268 (1985).) 

The above-described antibodies may be employed to 
isolate or to identify clones expressing LSG polypeptides 

10 or purify LSG polypeptides of the present invention by 

attachment of the antibody to a solid support for isolation 
and/ or purification by affinity chromatography. As 
discussed in more detail supra, antibodies specific against 
a LSG may also be used to image tumors, particularly cancer 

15 of the lung, in patients suffering from cancer. Such 
antibodies may also be used therapeutically to target 
tumors expressing a LSG. 

Preferred exemplary antigenic epitopes of LSGs of the 
present invention which have been identified are depicted 

20 below. The antigenicity index (Al avg) used is Jameson- 
Wolf. In some embodiment, it may be preferred to raise 
antibodies against these regions of the LSGs. 
DEX73_ 2.aa Antigenicity Index (Jameson-Wolf) 

(SEQ ID NO: 75) 

25 positions Al avg length 

16-50 1.06 35 

DEX73_JB.aa Antigenicity Index (Jameson- Wolf ) 

(SEQ ID NO: 76) 

positions Al avg length 

30 52-66 1.05 15 

DEX73_5.aa Antigenicity Index (Jameson- Wolf ) 

(SEQ ID NO: 77) 

positions Al avg length 

1419-1433 1.16 15 

35 1387-1414 1.08 28 
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808-825 1.00 18 

DEX73_8.aa Antigenicity Index (Jameson- Wolf ) 

(SEQ ID NO: 78) 

positions AI avg length 

5 208-223 1.06 16 

123-135 1.05 13 
689-717 1.03 29 
63-90 1.02 28 

653-683 1.01 31 
10 366-377 1.00 12 

DEX73_12.aa Antigenicity Index (Jameson- Wolf ) 

(SEQ ID NO: 80) 

positions AI avg length 

56-68 1.05 13 

15 DEX73_13.aa Antigenicity Index (Jameson-Wolf ) 

(SEQ ID NO: 81) 



positions 


AI avg 


length 


207-217 


1.28 


11 


72-85 


1.17 


14 


405-469 


1.15 


65 


151-171 


1.02 


21 



DEX73_18.aa Antigenicity Index (Jameson- Wolf ) 
(SEQ ID NO: 84) 

positions AI avg length 

25 40-51 1.27 12 



LSG binding molecules and assays 

This invention also provides a method for 
identification of molecules, such as receptor molecules, 
30 that bind LSGs . Genes encoding proteins that bind LSGs, 
such as receptor proteins, can be identified by numerous 
methods known to those of skill in the art. Examples 
include, but are not limited to, ligand panning and FACS 
sorting. Such methods are described in many laboratory 
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manuals such as, for instance, Coligan et al., Current 
Protocols in Immunology 1(2): Chapter 5 (1991). 

Expression cloning may also be employed for this 
purpose. To this end, polyadenylated RNA is prepared from 
5 a cell responsive to a LSG of the present invention. A cDNA 
library is created from this RNA and the library is divided 
into pools. The pools are then transfected individually 
into cells that are not responsive to a LSG of the present 
invention. The transfected cells then are exposed to 

10 labeled LSG. LSG polypeptides can be labeled by a variety 
of well-known techniques including, but not limited to, 
standard methods of radio- iodinat ion or inclusion of a 
recognition site for a site-specific protein kinase. 
Following exposure, the cells are fixed and binding of 

15 labeled LSG is determined. These procedures conveniently 
are carried out on glass slides. Pools containing labeled 
LSG are identified as containing cDNA that produced LSG- 
binding cells. Sub-pools are then prepared from these 
positives, transfected into host cells and screened as 

20 described above. Using an iterative sub-pooling and re- 
screening process, one or more single clones that encode 
the putative binding molecule, such as a receptor molecule, 
can be isolated. 

Alternatively a labeled ligand can be photoaf f inity 

25 linked to a cell extract, such as a membrane or a membrane 
extract, prepared from cells that express a molecule that 
it binds, such as a receptor molecule. Cross-linked 
material is resolved by polyacrylamide gel electrophoresis 
("PAGE") and exposed to X-ray film. The labeled complex 

30 containing the ligand- receptor can be excised, resolved 
into peptide fragments, and subjected to protein 
microsequencing. The amino acid sequence obtained from 
mi cr ©sequencing can be used to design unique or degenerate 
oligonucleotide probes to screen cDNA libraries to identify 

35 genes encoding the putative receptor molecule. 
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Polypeptides of the invention also can be used to 
assess LSG binding capacity of LSG binding molecules, such 
as receptor molecules, in cells or in cell -free 
preparations. 
5 Agonists and antagonists - assays and molecules 

The invention also provides a method of screening 
compounds to identify those which enhance or block the 
action of a LSG on cells. By "compound", as used herein, 
it is meant to be inclusive of small organic molecules, 

10 peptides, polypeptides and antibodies as well as any other 
candidate molecules which have the potential to enhance or 
agonize or block or antagonize the action of LSG on cells. 
As used herein, an agonist is a compound which increases 
the natural biological functions of a LSG or which 

15 functions in a manner similar to a LSG, while an 

antagonist, as used herein, is a compound which decreases 
or eliminates such functions. Various known methods for 
screening for agonists and/or antagonists can be adapted 
for use in identifying LSG agonist or antagonists. 

20 For example, a cellular compartment, such as a 

membrane or a preparation thereof, such as a membrane- 
preparation, may be prepared from a cell that expresses a 
molecule that binds a LSG, such as a molecule of a 
signaling or regulatory pathway modulated by LSG. The 

25 preparation is incubated with labeled LSG in the absence or 
the presence of a compound which may be a LSG agonist or 
antagonist. The ability of the compound to bind the 
binding molecule is reflected in decreased binding of the 
labeled ligand. Compounds which bind gratuitously, i.e., 

30 without inducing the effects of a LSG upon binding to the 
LSG binding molecule are most likely to be good 
antagonists. Compounds that bind well and elicit effects 
that are the same as or closely related to LSG are 
agonists. LSG-like effects of potential agonists and 

35 antagonists may by measured, for instance, by determining 
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activity of a second messenger system following interaction 
of the candidate molecule with a cell or appropriate cell 
preparation, and comparing the effect with that of LSG or 
molecules that elicit the same effects as LSG. Second 
5 messenger systems that may be useful in this regard 

include, but are not limited to, AMP guanylate cyclase, ion 
channel or phosphoinositide hydrolysis second messenger 
systems . 

Another example of an assay for LSG antagonists is a 

10 competitive assay that combines LSG and a potential 

antagonist with membrane -bound LSG receptor molecules or 
recombinant LSG receptor molecules under appropriate 
conditions for a competitive inhibition assay. LSG can be 
labeled, such as by radioactivity, such that the number of 

15 LSG molecules bound to a receptor molecule can be 

determined accurately to assess the effectiveness of the 
potential antagonist . 

Potential antagonists include small organic 
molecules, peptides, polypeptides and antibodies that bind 

20 to a LSG polypeptide of the invention and thereby inhibit 
or extinguish its activity. Potential antagonists also may 
be small organic molecules, a peptide, a polypeptide such 
as a closely related protein or antibody that binds the 
same sites on a binding molecule, such as a receptor 

25 molecule, without inducing LSG- induced activities, thereby 
preventing the action of LSG by excluding LSG from binding. 

Potential antagonists include small molecules which 
bind to and occupy the binding site of the LSG polypeptide 
thereby preventing binding to cellular binding molecules, 

30 such as receptor molecules, such that normal biological 

activity is prevented. Examples of small molecules include 
but are not limited to small organic molecules, peptides or 
peptide-like molecules. 

Other potential antagonists include antisense 

35 molecules . Antisense technology can be used to control gene 
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expression through antisense DNA or RNA or through triple- 
helix formation. Antisense techniques are discussed, for 
example, in Okano, J. Neurochem. 56: 560 (1991); 
OLIGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE 
5 EXPRESSION, CRC Press, Boca Raton, Pla. (1988). Triple 
helix formation is discussed in, for instance Lee et al., 
Nucleic Acids Research 6: 3073 (1979); Cooney et al., 
Science 241: 456 (1988); and Dervan et al . , Science 251: 
1360 (1991) . The methods are based on binding of a 

10 polynucleotide to a complementary DNA or RNA. For example, 
the 5 ? coding portion of a polynucleotide that encodes a 
mature LSG polypeptide of the present invention may be used 
to design an antisense RNA oligonucleotide of from about 10 
to 40 base pairs in length. A DNA oligonucleotide is 

15 designed to be complementary to a region of the gene 

involved in transcription thereby preventing transcription 
and the production of a LSG polypeptide. The antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks 
translation of the mRNA molecule into a LSG polypeptide. 

20 The oligonucleotides described above can also be delivered 
to cells such that the antisense RNA or DNA may be 
expressed in vivo to inhibit production of a LSG. 
Composi tions 

The present invention also relates to compositions 
25 comprising a LSG polynucleotide or a LSG polypeptide or an 
agonist or antagonist thereof. 

For example, a LSG polynucleotide, polypeptide or an 
agonist or antagonist thereof of the present invention may 
be employed in combination with a non- sterile or sterile 
3 0 carrier or carriers for use with cells, tissues or 

organisms, such as a pharmaceutical carrier suitable for 
administration to a subject. Such compositions comprise, 
for instance, a media additive or a therapeutically 
effective amount of a polypeptide of the invention and a 
35 pharmaceutically acceptable carrier or excipient. Such 



WO 02/18576 



PCTAJS01/26684 



- 72 - 

carriers may include, but are not limited to, saline, 
buffered saline, dextrose, water, glycerol, ethanol and 
combinations thereof. The formulation should suit the mode 
of administration. 
5 Compositions of the present invention will be 

formulated and dosed in a fashion consistent with good 
medical practice, taking into account the clinical 
condition of the individual patient (especially the side 
effects of treatment with the polypeptide or other compound 

10 alone) , the site of delivery, the method of administration, 
the scheduling of administration, and other factors known 
to practitioners. The "effective amount" for purposes 
herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically 

15 effective amount of secreted polypeptide administered 
parenterally per dose will be in the range of about 1, 
M9/ k 9/day to 10 mg/kg/day of patient body weight, although, 
as noted above, this will be subject to therapeutic 
discretion. More preferably, this dose is at least 0.01 

20 mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the polypeptide or other compound is 
typically administered at a dose rate of about 1 jzg/kg/hour 
to about 50 mg/kg/hour, either by 1-4 injections per day or 

25 by continuous subcutaneous infusion, for example, using a 
mini -pump. An intravenous bag solution may also be 
employed. The length of treatment needed to observe changes 
and the interval following treatment for responses to occur 
appears to vary depending on the desired effect. 

30 Pharmaceutical compositions containing the secreted 

protein of the invention are administered orally, rectally, 
parenterally , int rac i st emally , int ravaginally , 
intraperitoneally, topically (as by powders, ointments, 
gels, drops or transdermal patch) , bucally, or as an oral 

35 or nasal spray. "Pharmaceutically acceptable carrier" 
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refers to a non-toxic solid, semisolid or liquid filler, 
diluent, encapsulating material or formulation auxiliary of 
any type. The term "parenteral 11 as used herein refers to 
modes of administration which include intravenous, 
5 intramuscular, intraperitoneal, intrasternal, subcutaneous 
and intraarticular injection and infusion. 

The polypeptide or other compound is also suitably 
administered by sustained-release systems. Suitable 
examples of sustained- release compositions include 

10 semipermeable polymer matrices in the form of shaped 

articles, e. g., films, or microcapsules. Sustained-release 
matrices include polylactides (U.S. Patent 3,773,919 and EP 
58481) , copolymers of L-glutamic acid and gamma- ethyl -L- 
glutamate (Sidman, U. et al., Biopolymers 22: 547-556 

15 (1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15: 167-277 (1981), and R. 
Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl 
acetate (R. Langer et al . ) and poly-D- (-) -3-hydroxybutyric 
acid (EP 133,988). Sustained-release compositions also 

20 include liposomally entrapped polypeptides. Liposomes 

containing the polypeptide or other compound are prepared 
by well known methods (Epstein et al., Proc. Natl. Acad. 
Sci. USA 82: 3688-3692 (1985); Hwang et al., Proc. Natl. 
Acad. Sci. USA 77: 4030-4034 (1980); EP 52322; EP 36676; EP 

25 88046; EP 143949; EP 142641; Japanese Pat. Appl. 83-118008; 
U.S. Patent 4,485,045 and 4,544,545; and EP 102324). 
Ordinarily, the liposomes are of the small (about 200-800 
Angstroms) unilamellar type in which the lipid content is 
greater than about 30 mol. percent cholesterol, the 

30 selected proportion being adjusted for the optimal therapy. 

For parenteral administration, in one embodiment, the 
polypeptide or other compound is formulated generally by 
mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion) , with a 

35 pharmaceutically acceptable carrier, i.e., one that is non- 
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toxic to recipients at the dosages and concentrations 
employed and is compatible with other ingredients of the 
formulation. 

For example, the formulation preferably does not 
5 include oxidizing agents and other compounds that are known 
to be deleterious to the polypeptide or other compound. 

Generally, the formulations are prepared by 
contacting the polypeptide or other compound uniformly and 
intimately with liquid carriers or finely divided solid 
10 carriers or both. Then, if necessary, the product is shaped 
into the desired formulation. Preferably the carrier is a 
parenteral carrier, more preferably a solution that is 
isotonic with the blood of the recipient. Examples of such 
carrier vehicles include water, saline, Ringer's solution, 
15 and dextrose solution. Non-aqueous vehicles such as fixed 
oils and ethyl oleate are also useful herein, as well as 
liposomes . 

The carrier suitably contains minor amounts of 
additives such as substances that enhance isotonicity and 

20 chemical stability. Such materials are non-toxic to 

recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, succinate, 
acetic acid, and other organic acids or their salts; 
antioxidants such as ascorbic acid; low molecular weight 

25 (less than about ten residues) polypeptides, e. g., 
polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers 
such as polyvinylpyrrolidone; amino acids, such as glycine, 
glutamic acid, aspartic acid, or arginine; monosaccharides, 

30 disaccharides, and other carbohydrates including cellulose 
or its derivatives, glucose, mannose, or dextrins; 
chelating agents such as EDTA; sugar alcohols such as 
mannitol or sorbitol; counterions such as sodium; and/ or 
nonionic surfactants such as polysorbates, poloxamers, or 

35 PEG. 
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The polypeptide or other compound is typically- 
formulated in such vehicles at a concentration of about 0.1 
mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 
3 to 8. It will be understood that the use of certain of 
5 the foregoing excipients, carriers; or stabilizers will 
result in the formation of polypeptide salts or salts of 
the other compounds. 

Any polypeptide to be used for therapeutic 
administration should be sterile. Sterility is readily 

10 accomplished by filtration through sterile filtration 
membranes (e. g., 0.2 micron membranes). Therapeutic 
polypeptide compositions generally are placed into a 
container having a sterile access port, for example, an 
intravenous solution bag or vial having a stopper 

15 pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or 
multi-dose containers, for example, sealed ampules or 
vials, as an aqueous solution or as a lyophilized 
formulation for reconstitution. As an example of a 

20 lyophilized formulation, 10-ml vials are filled with 5 ml 
of sterile-filtered 1 % (w/v) aqueous polypeptide solution, 
and the resulting mixture is lyophilized. The infusion 
solution is prepared by reconstituting the lyophilized 
polypeptide using bacteriostatic Water- for- Inject ion. 

25 Kits 

The invention further relates to pharmaceutical packs 
and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned 
compositions of the invention. Associated with such 

30 container (s) can be a notice in the form prescribed by a 
governmental agency regulating the manufacture, use or sale 
of pharmaceuticals or biological products, reflecting 
approval by the agency of the manufacture, use or sale of 
the product for human administration. 

35 Administration 
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LSG polypeptides or polynucleotides or other 
compounds, preferably agonists or antagonists thereof of 
the present invention may be employed alone or in 
conjunction with other compounds, such as therapeutic 
5 compounds . 

The pharmaceutical compositions may be administered 
in any effective, convenient manner including, for 
instance, administration by topical, oral, anal, vaginal, 
intravenous , intraperitoneal , intramuscular, subcutaneous , 

10 intranasal or intradermal routes among others. 

The pharmaceutical compositions generally are 
administered in an amount effective for treatment or 
prophylaxis of a specific indication or indications. In 
general, the compositions are administered in an amount of 

15 at least about 10 A*g/kg body weight. However, it will be 
appreciated that optimum dosage will be determined by 
standard methods for each treatment modality and 
indication, taking into account the indication, its 
severity, route of administration, complicating conditions 

20 and the like. 

It will be appreciated that conditions caused by a 
decrease in the standard or normal expression level of a 
LSG polypeptide in an individual can be treated by 
administering the LSG polypeptide of the present invention, 

25 preferably in the secreted form, or an agonist thereof. 

Thus, the invention also provides a method of treatment of 
an individual in need of an increased level of a LSG 
polypeptide comprising administering to such an individual 
a pharmaceutical composition comprising an amount of the 

30 LSG polypeptide or an agonist thereof to increase the 
activity level of the LSG polypeptide in such an 
individual. For example, a patient with decreased levels 
of a LSG polypeptide may receive a daily dose 0.1-100 /xg/kg 
of a LSG polypeptide or agonist thereof for six consecutive 
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days. Preferably, if a LSG polypeptide is administered it 
is in the secreted form. 

Compositions of the present invention can also be 
administered to treating increased levels of a LSG 
5 polypeptide. For example, antisense technology can be used 
to inhibit production of a LSG polypeptide of the present 
invention. This technology is one example of a method of 
decreasing levels of a polypeptide, preferably a secreted 
form, due to a variety of etiologies, such as cancer. A 

10 patient diagnosed with abnormally increased levels of a 
polypeptide can be administered intravenously antisense 
polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 mg/kg day for 
21 days. This treatment is preferably repeated after a 7- 
day rest period if the treatment was well tolerated. 

15 Compositions comprising an antagonist of a LSG polypeptide 
can also be administered to decrease levels of LSG in a 
patient . 
Gene therapy 

The LSG polynucleotides, polypeptides, agonists and 
20 antagonists that are polypeptides may be employed in 

accordance with the present invention by expression of such 
polypeptides in vivo, in treatment modalities often 
referred to as "gene therapy. 11 

Thus, for example, cells from a patient may be 
25 engineered with a polynucleotide, such as a DNA or RNA, 
encoding a polypeptide ex vivo, and the engineered cells 
then can be provided to a patient to be treated with the 
polypeptide. For example, cells may be engineered ex vivo 
by the use of a retroviral plasmid vector containing RNA 
. 30 encoding a polypeptide of the present invention. Such 
methods are well-known in the art and their use in the 
present invention will be apparent from the teachings 
herein. 

Similarly, cells may be engineered in vivo for 
35 expression of a polypeptide in vivo by procedures known in 
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the art. For example, a polynucleotide of the invention 
may be engineered for expression in a replication defective 
retroviral vector, as discussed supra. The retroviral 
expression construct then may be isolated and introduced 
5 into a packaging cell transduced with a retroviral plasmid 
vector containing RNA encoding a polypeptide of the present 
invention such that the packaging cell now produces 
infectious viral particles containing the gene of interest. 
These producer cells may be administered to a patient for 
10 engineering cells in vivo and expression of the polypeptide 
in vivo. These and other methods for administering a 
polypeptide of the present invention would be apparent to 
those skilled in the art upon reading the instant 
application. 

15 Retroviruses from which the retroviral plasmid 

vectors herein above mentioned may be derived include, but 
are not limited to, Moloney Murine Leukemia Virus, spleen 
necrosis virus, retroviruses such as Rous Sarcoma Virus, 
Harvey Sarcoma Virus, avian leukosis virus, gibbon ape 

20 leukemia virus, human immunodeficiency virus, adenovirus, 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 
In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Leukemia Virus . 

Such vectors will include one or more promoters for 

25 expressing the polypeptide. The selection of a suitable 
promoter will be apparent to those skilled in the art from 
the teachings contained herein. However, examples of 
suitable promoters which may be employed include, but are 
not limited to, the retroviral LTR, the SV40 promoter, the 

30 human cytomegalovirus (CMV) promoter described in Miller et 
al., Biotechniques 7: 980-990 (1989), and eukaryotic 
cellular promoters such as the histone, RNA polymerase III, 
and beta-actin promoters. Other viral promoters which may 
be employed include, but are not limited to, adenovirus 

35 promoters, thymidine kinase (TK) promoters, and B19 
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used include respiratory syncytial virus (RSV) promoter, 
inducible promoters such as the MMT promoter, the 
metallothionein promoter, heat shock promoters, the albumin 
5 promoter, the ApoAI promoter, human globin promoters, viral 
thymidine kinase promoters such as the Herpes Simplex 
thymidine kinase promoter, retroviral LTRs, the beta-actin 
promoter, and human growth hormone promoters. The promoter 
also may be the native promoter which controls the gene 

10 encoding the polypeptide. 

The nucleic acid sequence encoding the polypeptide of 
the present invention will be placed under the control of a 
suitable promoter. 

In one embodiment, the retroviral plasmid vector is 

15 employed to transduce packaging cell lines to form producer 
cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, 
PA317, Y-2, Y-AM, PA12, T19-14X, VT-19-17-H2, YCRE, YCRIP, 
GP+E-86, GP+envAml2, and DAN cell lines as described in 

20 Miller, A., Human Gene Therapy 1: 5-14 (1990). The vector 
may be transduced into the packaging cells through any 
means known in the art. Such means include, but are not 
• limited to, elect roporation, the use of liposomes, and CaP0 4 
precipitation. Alternatively, the retroviral plasmid 

25 vector may be encapsulated into a liposome, or coupled to a 
lipid, and then administered to a host. The producer cell 
line will generate infectious retroviral vector particles 
which are inclusive of the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector 

30 particles then may be employed to transduce eukaryotic 
cells, either in vitro or in vivo. The transduced 
eukaryotic cells will express the nucleic acid sequence (s) 
encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem 

35 cells, embryonic carcinoma cells, as well as hematopoietic 
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stem cells, hepatocytes, fibroblasts, myoblasts, 
keratinocytes, endothelial cells, and bronchial epithelial 
cells . 

An exemplary method of gene therapy involves 
5 transplantation of fibroblasts which are capable of 

expressing a LSG polypeptide or an agonist or antagonist 
thereof onto a patient. Generally fibroblasts are obtained 
from a subject by skin biopsy. The resulting tissue is 
placed in tissue -culture medium and separated into small 

10 pieces. Small chunks of the tissue are placed on a wet 

surface of a tissue culture flask, approximately ten pieces 
are placed in each flask. The flask is turned upside down, 
closed tight and left at room temperature over night. 
After 24 hours at room temperature, the flask is inverted 

15 and the chunks of tissue remain fixed to the bottom of the 
flask and fresh media (e. g., Ham ! s F12 media, with 10% 
FBS, penicillin and streptomycin) is added. The flasks are 
then incubated at 37°C for approximately one week. At this 
time, fresh media is added and subsequently changed every 

20 several days. After an additional two weeks in culture, a 
monolayer of fibroblasts emerge. The monolayer is 
trypsinized and scaled into larger flasks. pMV-7 
(Kirschmeier, P. T. et al., DNA, 7: 219-25 (1988)), flanked 
by the long terminal repeats of the Moloney murine sarcoma 

25 virus, is digested with EcoRI and Hindlll and subsequently 
treated with calf intestinal phosphatase. The linear 
vector is fractionated on agarose gel and purified, using 
glass beads. The cDNA encoding a LSG polypeptide of the 
present invention or an agonist or antagonist thereof can 

3 0 be amplified using PCR primers which correspond to their 5 1 
and 3 1 end sequences respectively. Preferably, the 5' 
primer contains an EcoRI site and the 3 1 primer includes a 
Hindlll site. Equal quantities of the Moloney murine 
sarcoma virus linear backbone and the amplified EcoRI and 

35 Hindlll fragment are added together in the presence of T4 
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DNA ligase. The resulting mixture is maintained under 
conditions appropriate for ligation of the two fragments. 
The ligation mixture is then used to transform bacteria HB 
101, which are then plated onto agar containing kanamycin 
5 for the purpose of confirming that the vector has the gene 
of interest properly inserted. Amphotropic pA317 or 
GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco f s Modified Eagles Medium 
(DMEM) with 10% calf serum (CS) , penicillin and 

10 streptomycin. The MSV vector containing the gene is then 
added to the media and the packaging cells transduced with 
the vector. The packaging cells now produce infectious 
viral particles containing the gene (the packaging cells 
are now referred to as producer cells) . Fresh media is 

15 added to the transduced producer cells, and subsequently, 
the media is harvested from a 10 cm plate of confluent 
producer cells. The spent media, containing the infectious 
viral particles, is filtered through a millipore filter to 
remove detached producer cells and this media is then used 

20 to infect fibroblast cells. Media is removed from a sub- 
confluent plate of fibroblasts and quickly replaced with 
the media from the producer cells. This media is removed 
and replaced with fresh media. If the titer of virus is 
high, then virtually all fibroblasts will be infected and 

25 no selection is required. If the titer is very low, then 
it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the 
fibroblasts have been efficiently infected, the fibroblasts 
are analyzed to determine whether protein is produced. The 

30 engineered fibroblasts are then transplanted onto the host, 
either alone or after having been grown to confluence on 
cytodex 3 microcarrier beads. 

Alternatively, in vivo gene therapy methods can be 
used to treat LSG related disorders, diseases and 

35 conditions. Gene therapy methods relate to the 
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introduction of naked nucleic acid (DNA, RNA, and antisense 
DNA or RNA) sequences into an animal to increase or 
decrease the expression of the polypeptide. 

For example, a LSG polynucleotide of the present 
5 invention or a nucleic acid sequence encoding an agonist or 
antagonist thereto may be operatively linked to a promoter 
or any other genetic elements necessary for the expression 
of the polypeptide by the target tissue. Such gene therapy 
and delivery techniques and methods are known in the art, 

10 see, for example, WO 90/11092, WO 98/11779; U.S. Patents 
5,693,622, 5,705,151, and 5,580,859; Tabata H. et al . 
(1997) Cardiovasc. Res. 35 (3): 470-479, Chao J et al. 
(1997) Pharmacol. Res. 35 (6): 517-522, Wolff J. A. (1997) 
Neuromuscul. Disord. 7 (5): 314-318, Schwartz B. et al. 

15 (1996) Gene Ther. 3 (5): 405-411, Tsurumi Y. et al . (1996) 
Circulation 94 (12) : 3281-3290 (incorporated herein by 
reference) . The polynucleotide constructs may be delivered 
by any method that delivers injectable materials to the 
cells of an animal, such as, injection into the 

20 interstitial space of tissues (heart, muscle, skin, lung, 
liver, intestine and the like) . The polynucleotide 
constructs can be delivered in a pharmaceutically 
acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers 

25 to sequences that are free from any delivery vehicle that 
acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome 
formulations, lipofectin or precipitating agents and the 
like. However, polynucleotides may also be delivered in 

30 liposome formulations (such as those taught in Feigner P. 
L. et al. (1995) Ann. NY Acad. Sci . 772: 126-139 and 
Abdallah B. et al. (1995) Biol. Cell 85 (1): 1-7) which can 
be prepared by methods well known to those skilled in the 
art . 
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The polynucleotide vector constructs used in the gene 
therapy method are preferably constructs that will not 
integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter 
5 known to those skilled in the art can be used for driving 
the expression of DNA. Unlike other gene therapies 
techniques, one major advantage of introducing naked 
nucleic acid sequences into target cells is the transitory 
nature of the polynucleotide synthesis in the cells. 

10 Studies have shown that non- replicating DNA sequences can 
be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the 
interstitial space of tissues within the an animal, 

15 including of muscle, skin, brain, lung, liver, spleen, bone 
marrow, thymus, heart, lymph, blood, bone, cartilage, 
pancreas, kidney, gall bladder, stomach, intestine, testis, 
ovary, uterus, rectum, nervous system, eye, gland, and 
connective tissue. Interstitial space of the tissues 

2 0 comprises the intercellular fluid, mucopolysaccharide 

matrix among the reticular fibers of organ tissues, elastic 
fibers in the walls of vessels or chambers, collagen fibers 
of fibrous tissues, or that same matrix within connective 
tissue ensheathing muscle cells or in the lacunae of bone. 

25 It is similarly the space occupied by the plasma of the 
circulation and the lymph fluid of the lymphatic channels. 
Delivery to the interstitial space of muscle tissue is 
preferred. The polynucleotide construct may be 
conveniently delivered by injection into the tissues 

30 comprising these cells. They are preferably delivered to 
and expressed in persistent, non-dividing cells which are 
differentiated, although delivery and expression may be 
achieved in non-differentiated or less completely 
differentiated cells, such as, for example, stem cells of 

35 blood or skin fibroblasts. In vivo muscle cells are 
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particularly competent in their ability to take up and 
express polynucleotides. 

For the naked polynucleotide injection, an effective 
dosage amount of DNA or RNA will be in the range of from 
5 about 0.05 fig/kg body weight to about 50 mg/kg body weight. 
Preferably the dosage will be from about 0.005 mg/kg to 
about 20 mg/kg and more preferably from about 0.05 mg/kg to 
about 5 mg/kg. Of course, as the artisan of ordinary skill 
will appreciate, this dosage will vary according to the 

10 tissue site of injection. The appropriate and effective 
dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the 
condition being treated and the route of administration. 
The preferred route of administration is by the parenteral 

15 route of injection into the interstitial space of tissues. 
However, other parenteral routes may also be used, such as, 
inhalation of an aerosol formulation particularly for 
delivery to lungs or bronchial tissues, throat or mucous 
membranes of the nose. In addition, naked polynucleotide 

20 constructs can be delivered to arteries during angioplasty 
by the catheter used in the procedure. 

The dose response effects of injected polynucleotide 
in muscle in vivo is determined as follows. Suitable 
template DNA for production of mRNA coding for polypeptide 

25 of the present invention is prepared in accordance with a 
standard recombinant DNA methodology. The template DNA, 
which may be either circular or linear, is either used as 
naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of 

30 the template DNA. 

Five to six week old female and male Balb/C mice are 
anesthetized by intraperitoneal injection with 0.3 ml of 
2.5% Avertin. A 1.5 cm incision is made on the anterior 
thigh, and the quadriceps muscle is directly visualized. 

35 The template DNA is injected in 0.1 ml of carrier in a 1 cc 
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syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the 
muscle into the knee and about 0.2 cm deep. A suture is 
placed over the injection site for future localization, and 
5 the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) 
muscle extracts are prepared by excising the entire 
quadriceps. Every fifth 15 /an cross-section of the 
individual quadriceps muscles is histochemically stained 

10 for protein expression. A time course for protein 

expression may be done in a similar fashion except that 
quadriceps from different mice are harvested at different 
times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing 

15 total cellular DNA and HIRT supernatants from injected and 
control mice. 

The results of the above experimentation in mice can 
be use to extrapolate proper dosages and other treatment 
parameters in humans and other animals using naked DNA. 

20 Nonhuman Transgenic Animals 

The LSG polypeptides of the invention can also be 
expressed in nonhuman transgenic animals. Nonhuman animals 
of any species, including, but not limited to, mice, rats, 
rabbits, hamsters, guinea pigs, pigs, micro-pigs, goats, 

25 sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees, may be used to generate 
transgenic animals. Any technique known in the art may be 
used to introduce the transgene (I. e., polynucleotides of 
the invention) into animals to produce the founder lines of 

30 transgenic animals. Such techniques include, but are not 
limited to, pronuclear microinjection (Paterson et al . , 
Appl. Microbiol. Biotechnol. 40: 691-698 (1994); Carver et 
al., Biotechnology (NY) 11: 1263-1270 (1993); Wright et 
al., Biotechnology (NY) 9: 830-834 (1991); and Hoppe et 

35 al., U.S. Patent 4,873,191); retrovirus mediated gene 
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transfer into germ lines (Van der Putten et al., Proc. 
Natl. Acad. Sci., USA 82: 6148-6152 (1985)), blastocysts or 
embryos; gene targeting in embryonic stem cells (Thompson 
et al., Cell 56: 313-321 (1989)); electroporation of cells 
5 or embryos (Lo, 1983, Mol. Cell. Biol. 3: 1803-1814 
(1983)); introduction of the polynucleotides of the 
invention using a gene gun (see, e.g., Ulmer et al., 
Science 259: 1745 (1993); introducing nucleic acid 
constructs into embryonic pluripotent stem cells and 

10 transferring the stem cells back into the blastocyst; and 
sperm mediated gene transfer (Lavitrano et al., Cell 57: 
717-723 (1989)). For a review of such techniques, see 
Gordon, "Transgenic Animals, " Intl. Rev. Cytol. 115: 171-229 
(1989), which is incorporated by reference herein in its 

15 entirety. 

Any technique known in the art may be used to produce 
transgenic clones containing polynucleotides of the 
invention, for example, nuclear transfer into enucleated 
oocytes of nuclei from cultured embryonic, fetal, or adult 

20 cells induced to quiescence (Campell et al., Nature 380: 
64-66 (1996); Wilmut et al . , Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals 
that carry the transgene in all their cells, as well as 
animals which carry the transgene in some, but not all 

25 their cells, i.e., mosaic or chimeric animals. The 

transgene may be integrated as a single transgene or as 
multiple copies such as in concatamers, e. g., head-to-head 
tandems or head-to-tail tandems. The transgene may also be 
selectively introduced into and activated in a particular 

30 cell type by following, for example, the teaching of Lasko 
et al. (Lasko et al./ Proc. Natl. Acad. Sci. USA 89: 6232- 
6236 (1992)) . The regulatory sequences required for such a 
cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to 

35 those of skill in the art. When it is desired that the 



WO 02/18576 



PCT/US01/26684 



- 87 - 

polynucleotide transgene be integrated into the chromosomal 
site of the endogenous gene, gene targeting is preferred. 
Briefly, when such a technique is to be utilized, vectors 
containing some nucleotide sequences homologous to the 
5 endogenous gene are designed for the purpose of 

integrating, via homologous recombination with chromosomal 
sequences, into and disrupting the function of the 
nucleotide sequence of the endogenous gene. The transgene 
may also be selectively introduced into a particular cell 

10 type, thus inactivating the endogenous gene in only that 

cell type, by following, for example, the teaching of Gu et 
al. (Science 265: 103-106 (1994)). The regulatory 
sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of 

15 interest, and will be apparent to those of skill in the 
art . 

Once transgenic animals have been generated, the 
expression of the recombinant gene may be assayed utilizing 
standard techniques. Initial screening may be accomplished 

20 by Southern blot analysis or PCR techniques to analyze 

animal tissues to verify that integration of the transgene 
has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also 
be assessed using techniques which include, but are not 

25 limited to, Northern blot analysis of tissue samples 

obtained from the animal, in situ hybridization analysis, 
and reverse transcriptase-PCR (rt-PCR) . Samples of 
transgenic gene-expressing tissue may also be evaluated 
immunocytochemically or immunohistochemically using 

30 antibodies specific for the transgene product. 

Once the founder animals are produced, they may be 
bred, inbred, outbred, or crossbred to produce colonies of 
the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of 

35 founder animals with more than one integration site in 
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order to establish separate lines; inbreeding of separate 
lines in order to produce compound transgenics that express 
the transgene at higher levels because of the effects of 
additive expression of each transgene; crossing of 
5 heterozygous transgenic animals to produce animals 

homozygous for a given integration site in order to both 
augment expression and eliminate the need for screening of 
animals by DNA analysis; crossing of separate homozygous 
lines to produce compound heterozygous or homozygous lines; 
10 and breeding to place the transgene on a distinct 

background that is appropriate for an experimental model of 
interest . 

Transgenic animals of the invention have uses which 
include, but are not limited to, animal model systems 

15 useful in elaborating the biological function of LSG 

polypeptides of the present invention, studying conditions 
and/or disorders associated with aberrant expression of 
LSGs, and in screening for compounds effective in 
ameliorating such LSG associated conditions and/or 

20 disorders. 

Knock -Out Animals 

Endogenous gene expression can also be reduced by 
inactivating or "knocking out" the gene and/or its promoter 
using targeted homologous recombination (e. g., see 

25 Smithies et al . , Nature 317: 230-234 (1985); Thomas & 

Capecchi, Cell 51: 503512 (1987); Thompson et al., Cell 5: 
313-321 (1989) ; each of which is incorporated by reference 
herein in its entirety) . For example, a mutant, non- 
functional LSG polynucleotide of the invention (or a 

30 completely unrelated DNA sequence) flanked by DNA 

homologous to the endogenous LSG polynucleotide sequence 
(either the coding regions or regulatory regions of the 
gene) can be used, with or without a selectable marker 
and/or a negative selectable marker, to transfect cells 

35 that express polypeptides of the invention in vivo. In 
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another embodiment, techniques known in the art are used to 
generate knockouts in cells that contain, but do not 
express the gene of interest. Insertion of the DNA 
construct, via targeted homologous recombination, results 
5 in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields 
where modifications to embryonic stem cells can be used to 
generate animal offspring with an inactive targeted gene 
(e. g., see Thomas & Capecchi 1987 and Thompson 1989, 

10 supra) . This approach can also be routinely adapted for 
use in humans provided the recombinant DNA constructs are 
directly administered or targeted to the required site in 
vivo using appropriate viral vectors that will be apparent 
to those of skill in the art. 

15 In further embodiments of the invention, cells that 

are genetically engineered to express the LSG polypeptides 
of the invention, or alternatively, that are genetically 
engineered not to express the LSG polypeptides of the 
invention (e. g., knockouts) are administered to a patient 

20 in vivo. Such cells may be obtained from the patient or a 
MHC compatible donor and can include, but are not limited 
to, fibroblasts, bone marrow cells, blood cells (e. g., 
lymphocytes) , adipocytes, muscle cells, and endothelial 
cells. The cells are genetically engineered in vitro using 

25 recombinant DNA techniques to introduce the coding sequence 
of polypeptides of the invention into the cells, or 
alternatively, to disrupt the coding sequence and/or 
endogenous regulatory sequence associated with the 
polypeptides of the invention, e. g., by transduction 

30 (using viral vectors, and preferably vectors that integrate 
the transgene into the cell genome) or transfection 
procedures, including, but not limited to, the use of 
plasmids, cosmids, YACs, naked DNA, electroporation, 
liposomes, etc. 
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The coding sequence of the LSG polypeptides of the 
invention can be placed under the control of a strong 
constitutive or inducible promoter or promoter/enhancer to 
achieve expression, and preferably secretion, of the LSG 
5 polypeptides of the invention. The engineered cells which 
express and preferably secrete the LSG polypeptides of the 
invention can be introduced into the patient systemically, 
e.g., in the circulation, or intraperitoneally . 

Alternatively, the cells can be incorporated into a 

10 matrix and implanted in the body, e.g., genetically 

engineered fibroblasts can be implanted as part of a skin 
graft or genetically engineered endothelial cells can be 
implanted as part of a lymphatic or vascular graft (see, 
for example, U.S. Patent 5,399,349 and U.S. Patent 

15 5,460,959 each of which is incorporated by reference herein 
in its entirety) . 

When the cells to be administered are non-autologous 
or non-MHC compatible cells, they can be administered using 
well known techniques which prevent the development of a 

20 host immune response against the introduced cells. For 
example, the cells may be introduced in an encapsulated 
form which, while allowing for an exchange of components 
with the immediate extracellular environment, does not 
allow the introduced cells to be recognized by the host 

25 immune system. 

Transgenic and "knock-out" animals of the invention 
have uses which include, but are not limited to, animal 
model systems useful in elaborating the biological function 
of LSG polypeptides of the present invention, studying 

30 conditions and/or disorders associated with aberrant LSG 
expression, and in screening for compounds effective in 
ameliorating such LSG associated conditions and/or 
disorders. 

The following nonlimiting example is provided to 
35 further illustrate the present invention. 
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The following Example is carried out using standard 
techniques, which are well known and routine to those of 
skill in the art, except where otherwise described in 
5 detail. Routine molecular biology techniques of the 
following example can be carried out as described in 
standard laboratory manuals, such as Sambrook et al., 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed.; Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
10 (1989) . 

Introduction and background for Microarray analysis 

cDNA microarrays are prepared by high-speed robotic 
printing of thousands of distinct cDNAs in an ordered array 
on glass microscope slides. They are used to measure the 

15 relative abundance of specific sequences in two complex 
samples (Schena et al, 1995; Shalon et al, 1996) . 

In the microarray procedure, mRNA is isolated from 
tissues of interest, either from a tumor or control (normal 
or normal adjacent tissue) . mRNA (200-600 ng) from cancer 

20 tissue or control is reverse transcribed to incorporate the 
fluorescent nucleotides Cy5 (red) or Cy3 (green) , 
respectively. The two populations of f luorescently labeled 
cDNA are mixed together and hybridized simultaneously to a 
microarray bearing approximately 10,000 cDNA elements in a 

25 2cm x 2cm area on a glass slide (Microarrays hybridization 
service: Incyte Genomics, Fremont, CA, USA) . After 
hybridization, the slides are scanned with a scanning laser 
confocal microscope. 

The scanned image is used to generate the intensity 

30 and local background measurements for each spot on the 

array (GEMtools software, Incyte Genomics) . For each spot, 
representing one EST, the ratio of the normalized Cy5/Cy3 
intensities generates a quantitation of the gene's 
expression in one tissue relative to the control, in this 
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case, the expression in cancer tissue versus either normal 
or normal adjacent tissue. For example, a gene that shows 
a Cancer-CyS intensity of 3000 and a Normal -Cy3 intensity 
of 1000 is expressed 3-fold more in cancer tissue. 
5 Advanced analysis software is used to sort and decipher 
patterns of gene expression from the data (Cluster and 
Treeview programs, Stanford University; Eisen et al, 1998; 
Alizadeh et al, 2000) . However, the reproducibility study 
from Incyte shows that the level of detectable differential 
10 expression is calculated to be approximately plus or minus 
1.74. Consequently, any elements with observed ratios 
greater than or equal to 1.8 between cancer and normal are 
deemed differentially expressed. 
References : 

15 1. Schena, M. , D. Shalon, R.W. Davis, and P.O. Brown. 
1995. Quantitative monitoring of gene expression patterns 
with a complementary cDNA microarray. Science 270: 
467-470. 

2. Shalon, D., S.J. Smith, and P.O. Brown. 1996. A DNA 
20 Microarray System for Analyzing Complex DNA samples Using 

Two-color Fluorescent Probe Hybridization. Genome Research 
6: 639-645. 

3. Eisen, M.B., P.T. Spellman, P.O. Brown, and D. Botstein. 
1998. "Cluster analysis and display of genome-wide 

25 expression patterns" . PNAS 95: 14863-14868. 

4. Alizadeh, A. A., et al, 2000. "Distinct types of diffuse 
large B-cell lymphoma identified by gene expression 
profiling." Nature, 403: 503-511. 

5. GEM Microarray Reproducibility Study. Technical 
30 specifications from Incyte Genomics. 

Lung diaDexus microarray candidates 

Following is a list of "diaDexus microarray 
candidates" sequences for lung cancer, also referred to 
herein as lung specific genes or LSGs: 
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SEQ ID 


Gene ID 


ddx lung code 


dox qfcr xung 




1/19 


r\ ^9 4"% A r<— f"f 

979057 


Lngl28 


Lngl28 




2/20 


347842 


Lngl29 


Lngl29 




3 


983439 


Lngll2 


Lngll2 


5 


4 


236582 


Lngll4 


Lngll4 




5 


210995 


Lngll8 


Lngll8 




6 


208994 


Lngl21 


Lngl21 




7 


1066498 


Lngl24 


Ijngl24 




8 


287016 


Lngl26 


Lngl26 


10 


9 


10717 


SQLngOOl 


Lngl36 




10 


24945 


SQIingOOS 


Lngl43 




11 


52017 


SQLng007 


Lngl44 




12 


460254 


SQLngllO 


Lngl38 




13/74 


179090 


SQLng012 


Lngl37 


15 


14 


6348 


SQLng004 


Lngl42 




15 


94694 


SQLngOOS 


Lngl40 




16 


145812 


SQLng008 


LnglSl 




17 


10713 


SQLng002 


Lngl50 




18 


20152 


SQLng003 


Lngl41 



20 Example 1 
Sequence 1 



Lngl28 

Gene ID 979057 

Table 1. The absolute numbers are relative levels of 
25 expression of Lngl28 in 24 normal different tissues. All 
the values are compared to normal trachea (calibrator) . 
These RNA samples are commercially pools, originated by- 
pooling samples of a particular tissue from different 
individuals . 



30 



Tissue 


NORMAL 


Adrenal Gland 


0.03 


Bladder 


0.00 


Brain 


6.68 


Cervix 


0.00 


Colon 


0.00 


Endometrium 


0.12 


Esophagus 


0.00 


Heart 


0.01 
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Kidney 


0 . 02 




Liver 


0 . 03 




Liang 


35.63 




Mammary Gland 


0.02 




Muscle 


0 . 00 




Ovary 


1.11 




Pancreas 


17 . 94 




Prostate 


0.42 




Rectum 


0.16 




Small Intestine 


0.00 




Spleen 


1.27 




Stomach 


0.00 




Testis 


2.17 




Thymus 


0.13 




Trachea 


1.00 




Uterus 


0.09 




O^negative 





The relative levels of expression in Table 1 show 
that Lngl28 mRNA expression is much higher in lung (35.63) 

20 compared with most other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 

25 samples of a single individual in Table 2. 

Table 2. The absolute numbers are relative levels of 
expression of Lngl28 in 69 pairs of matching samples and 1 
ovary normal and one ovary cancer sample. All the values 
are compared to normal trachea (calibrator) . A matching 

3 0 pair is formed by mRNA from the cancer sample for a 

particular tissue and mRNA from the normal adjacent sample 
for that same tissue from the same individual. 





Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 




Lng 60L 


Adenocarcinoma 


Lung 


1 


196 


04 


17.03 


35 


Lng 143L 


Adenocarcinoma 


Lung 


2 


21 


93 


0.88 




Lng 60XL 


Adenocarcinoma 


Lung 


3 


17 


21 


42.37 




Lng AC82 


Adenocar c inoma 


Lung 


4 


43 


26 


4.56 




Lng AC88 


Adeno c arc inoma 


Lung 


5 


364 


56 


101.48 




Lng AC66 


Adenocarcinoma 


Lung 


6 


17 


94 


13.27 


40 


Lng ACS 9 


Adenocarcinoma 


Lung 


7 


582 


05 


39.12 




Lng AC11 


Adenocarcinoma 


Lung 


8 


24 


42 


113.38 




Lng AC32 


Adenocarcinoma 


Lung 


9 


648 


07 


27.19 




Lng AC39 


Adenocarcinoma 


Lung 


10 


249 


00 


1.71 




Lng AC94 


Adenocarcinoma 


Lung 


11 


42 


81 


112.99 


45 


Lng AC90 


Adenocarcinoma 


Lung 


12 


196 


72 


0.48 




Lng 47XQ 


Adenocar c inoma 


Lung 


13 


88 


95 


0.54 
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40 



Lng 


223L 


Adenocarcinoma 




Lng 


528L 


Adenocarcinoma 




Lng 


BR26 


Bronchio- alveolar 






carcinoma 




Lng 


BA641 


Bronchogeni c 








carcinoma 




Lng 


315L 


Squamous cell 








carcinoma 




Lng 


SQ45 


Squamous cell 








carcinoma 




Lng 


SQ14 


Squamous cell 








carcinoma 




Lng 


SQ9X 


Squamous cell 








carcinoma 




Lng 


SQ56 


Squamous cell 








carcinoma 




Lng 


SQ80 


Squamous cell 








carcinoma 




Lng 


SQ32 


Squamous cell 








carcinoma 




Lng 


SQ16 


Squamous cell 








carcinoma 




Lng 


SQ79 


Squamous cell 








carcinoma 




Lng 


90X 


Squamous cell 








carcinoma 




Lng 


BR94 


Squamous cell 








carcinoma 




Lng 


C20X 


Squamous cell 








carcinoma 




Lng 


SQ44 


Squamous cell 








carcinoma 




Lng 


SQ43 


Squamous cell 








carcinoma 




Lng 


77L 


Large cell carcinoma 


Lng 


LC71 


Large cell carcinoma 


Lng 


LCI 09 


Large cell carcinoma 


Lng 


LC80 


Large cell carcinoma 


Lng 


75XC 


Metastatic from 


bone 






cancer 




Lng 


MT71 


Metastatic from 


renal 






cell cancer 




Lng 


MT67 


Metastatic from 








melanoma 




Bid 


46XK 






Bid 


TR14 






Cvx 


KS52 






Cvx 


KS83 






Cln 


AS45 






Cln 


RC01 






End 


8911 






End 


28XA 






kid 


107XD 






Kid 


109XD 






Liv 


94XA 






Liv 


174L 






Mam 


162X 






Mam 


497M 






Ovr 


A082 






Ovr 


18GA 






Ovr 


180B 







95 - 



Lung 14 


5 


.80 


0.00 


Lung 15 


45 


. 25 


177 . 91 


Lung 16 


2 


.80 


28.54 


Lung 17 


1746 


.20 


36.13 


Lung 18 


1 


.67 


736.73 


Lung 19 


828 


.87 


62.68 


Lung 20 


0 


.07 


15.56 


Lung 21 


73 


.26 


4.32 


Lung 22 


33 


.24 


141.53 


Lung 23 


101 


.13 


44.79 


Lung 24 


119 


.43 


9.82 


Lung 25 


64 


.00 


10.85 


Lung 26 


52 


.16 


142.52 


Lung 27 


38 


.72 


6.23 


Lung 28 


27 


.19 


0.00 


Lung 29 


0 


.00 


1.59 


Lung 30 


13 


.88 


0.04 


Lung 31 


24 


.00 


1.39 


Lung 32 


0 


.15 


13 . 93 


Lung 33 


61 


.61 


190 . 68 


Lung 34 


25 


19 


513. 78 


Lung 35 


537 


.45 


47.01 


Lung 36 


44 


79 


39.95 


Lung 37 


11 


35 


26.45 


Lung 3 8 


3 


28 


7.97 


Bladder 1 


0 


00 


0.00 


Bladder 2 


0 


46 


0.00 


Cervix 1 


0 


29 


0.00 


Cervix 2 


0 


00 


0.00 


Colon 1 


0 


00 


0.00 


Colon 2 


0 


00 


0.10 


Endometrium 


0 


08 


0.68 


1 

Endometrium 
2 


12. 


73 


0.57 


Kidney 1 


0 


02 


0.02 


Kidney 2 


0, 


05 


0.25 


Liver 1 


0. 


00 


0.00 


Liver 2 


0. 


00 


0.00 


Mammary 1 


0. 


00 


0.02 


Mammary 2 


0. 


00 


0.00 


Ovary 1 


0. 


03 


1.57 


Ovary 2 






5.78 


Ovary 3 


0. 


03 
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Pan 71X 


Pancreas 1 


0 


03 


0.02 


Pan 92X 


Pancreas 2 


0.65 


0.00 


Pro 


Prostate 1 


0 


01 


0.03 


109XB 










Pro 


Prostate 2 


0 


02 


0.02 


125XB 










Skn 248S 


Skin 1 


0 


11 


0.00 


Skn 816S 


Skin 2 


1 


01 


0.00 


Smlnt 


Small 


0 


01 


0.00 


21XA 


Intestine 1 








Sralnt H89 


Small 

Intestine 2 


0 


67 


2.76 


Sto 758S 


Stomach 1 


0 


00 


0.00 


Sto 531S 


Stomach 2 


0 


08 


0.00 


Tst 647T 


Testis 1 


4.38 


0.96 


Tst 39X 


Testis 2 


8 


69 


1.19 


Thr 14 3N 


Thyroid 1 


0 


.15 


0.00 


Thr 270T 


Thyroid 2 


0 


.00 


0.00 


Utr 135X0 


Uterus 1 


0 


19 


0.27 


Utr 141X0 


Uterus 2 


0 


06 


0.00 


0= Negative 



In the analysis of matching samples, higher expression 



of lngl28 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 

25 pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

30 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows overexpression of 
Lngl28 in 3 8 lung cancer tissues compared with their 
respective normal adjacent (lung samples #1, 2, 4, 5, 7, 
9, 10, 12, 13, 14, 17, 19, 21, 23, 24, 25, 27, 28, 30, 31, 

35 and 35,) . There is overexpression in the cancer tissue for 
55% of the lung matching samples tested (21 out of total of 
3 8 lung matching samples) . 
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Altogether, the high level of tissue specificity, plus 
the tnRNA differential expression in the lung matching 
samples tested are believed to make Lngl28 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
5 lung cancer. 

Northern Analysis 

Two transcripts - 2.2 kb and - 4.2 kb 
Primers Used for QPCK Expression Analysis 



10 Forward primer 

CTTGGTCTTCCTGCTCCTGAC (SEQ ID NO: 21) 
Reverse primer 

AGGGCAGAGAGGAACAGCA (SEQ ID NO: 22) 
Probe 

15 CCAGCGAGGAGCAGCAGGGATG (SEQ ID NO: 23) 



20 



25 



30 



35 



Example 2 
Sequence 2 
Lngl29 

Gene ID 347842 

Table 1. The absolute numbers are relative levels of 
expression of Lngl29 in 24 normal different tissues. All 
the values are compared to normal spleen (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 
different individuals. 



Tissue 



NORMAL 



Adrenal Gland 

Bladder 

Brain 

Cervix 

Colon 

Endometrium 

Esophagus 

Heart 

Kidney 

Liver 

Lung 



0.00 
0.00 
0.00 
0.02 
0.00 
0.03 
0.00 
0.00 
0.00 
0.01 
0.12 
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Mammary Gland 


0.00 


Muscle 


0.00 


Ovary 


0.04 


Pancreas 


0.00 


Prostate 


0.01 


Rectum 


0.00 


Small Intestine 


0.00 


Spleen 


1.00 


Stomach 


0.00 


Testis 


0.01 


Thymus 


0.03 


Trachea 


0.06 


Uterus 


0.06 


0= negative 



15 The relative levels of expression in Table 1 show that 

Lngl29 mRNA expression is high compared with most other 
normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

20 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2 , 
Table 2. The absolute numbers are relative levels of 
expression of Lngl29 in 67 pairs of matching samples and 1 

25 ovary normal and one ovary cancer sample. All the values 
are compared to normal spleen (calibrator) . A matching 
pair is formed by mRNA from the cancer sample for a 
particular tissue and mRNA from the normal adjacent sample 
for that same tissue from the same individual. 



30 


Sample ID Cancer Type 


Tissue 


CANCER MATCHING 


NORMAL 




Lng 60L Adenocarcinoma 


Lung 1 


0.71 


0.69 




Lng 14 3 L Adenocar c inoma 


Lung 2 


0.01 


0.00 




Lng 60 XL Adenocarcinoma 


Lung 3 


0.00 


0.01 




Lng AC 8 2 Adenocarcinoma 


Lung 4 


0.31 


0.00 


35 


Lng AC88 Adenocarcinoma 


Lung 5 


0.00 


0.00 




Lng AC66 Adenocarcinoma 


Lung 6 


0.80 


0.07 




Lng AC69 Adenocarcinoma 


Lung 7 


0.00 


0.00 




Lng AC11 Adenocarcinoma 


Lung 8 


0.48 


0.06 




Lng AC32 Adenocarcinoma 


Lung 9 


0.39 


0.00 


40 


Lng AC3 9 Adenocarc inoma 


Lung 10 


0.53 


0.01 




Lng AC 9 4 Adenocarcinoma 


Lung 11 


0.05 


0.03 




Lng AC90 Adenocarcinoma 


Lung 12 


0.04 


0.00 




ling 47XQ Adenocarcinoma 


Lung 13 


0.12 


0.00 




Lng 22 3 L Adenocarcinoma 


Lung 14 


0.04 


0.00 


45 


Lng 52 8L Adenocarcinoma 


Lung 15 


0.00 


0.00 




Lng BR26 Bronchio- alveolar 


Lung 16 


0.24 


0.94 
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Lng 


BA641 


Bronchogenic carcinoma 


Lung 17 


0.40 


0 


.10 




Lng 


315L 


Squamous cell 


Lung 18 


0.03 


0 


.12 




Lng 


SQ45 


Squamous cell 


Lung 19 


0.00 


0 


.00 




Lng 


SQ14 


Squamous cell 


Lung 20 


0.02 


0 


.11 


5 


Lng 


SQ9X 


Squamous cell 


Lung 21 


0.00 


0 


.00 




ling 


SQ56 


Squamous cell 


Lung 22 


0.43 


0 


.12 




Lng 


SQ80 


Squamous cell 


Lung 23 


0.00 


0 


.00 




Lng 


SQ32 


Squamous cell 


Lung 24 


0.06 


0 


.00 




Lng 


SQ16 


Squamous cell 


Lung 25 


0.01 


0 


.00 


10 


Lng 


SQ79 


Squamous cell 


Lung 26 


0.11 


0 


.04 




Lng 


9 OX 


Squamous cell 


Lung 27 


0.00 


0 


.00 




Lng 


BR 9 4 


Squamous cell 


Lung 28 


4.76 


0 


.00 




Lng 


C20X 


Squamous cell 


Lung 29 


0.00 


0 


.00 




Lng 


SQ44 


Squamous cell 


Lung 30 


0.04 


0 


.00 


15 


Lng 


SQ43 


Squamous cell 


Lung 31 


0.82 


0 


.08 




Lng 


77L 


Large cell carcinoma 


Lung 32 


0.00 


0 


.00 




Lng 


LC71 


Large cell carcinoma 


Lung 33 


0.05 


0 


.30 




Lng 


LCI 09 Large cell carcinoma 


Lung 34 


1.48 


0 


.90 




Lng 


LC80 


Large cell carcinoma 


Lung 35 


1.09 


0 


.00 


20 


Lng 


75XC 


Metastatic from bone 


Lung 36 


0.00 


0 


.00 




Lng 


MT71 Metastatic from renal 


Lung 37 


0.18 


0 


.04 




Lng 


MT67 Metastatic from 


Lung 38 


0.55 


0 


.04 




Bid 


46XK 




Bladder 1 


0.02 


0 


.00 




Bid 


TR14 




Bladder 2 


0.46 


0 


.39 


25 


Cvx 


KS52 




Cervix 1 


0.26 


0 


.03 




Cvx 


KS83 




Cervix 2 


0.00 


0 


.00 




ClnAS45 




Colon 1 


0.00 


0 


.00 




ClnRCOl 




Colon 2 


0.01 


0 


.02 




End 






Endometrium 


0.00 


0 


.00 


30 


kid 


107XD 




Kidney 1 


1.53 


0 


.03 




Kid 


109XD 




Kidney 2 


0.33 


0 


.11 




Liv 


175L 




Liver 1 


0.27 


0 


.03 




Livl74L 




Liver 2 


0.01 


0 


.01 




Mam 






Mammary 1 


0.02 


0 


.01 


35 


Mam 


497M 




Mammary 2 


0.00 


0 


.00 




Ovr 


A082 




Ovary 1 


0.01 


0 


.00 




Ovr 


18GA 




Ovary 2 




0 


.01 




Ovr 


180B 




Ovary 3 


0.00 








Pan 


77X 




Pancreas 1 


0.00 


0 


.00 


40 


Pan 






Pancreas 2 


0.00 


0 


.00 




Pro 






Prostate 1 


0.01 


0 


02 




Pro 






Prostate 2 


0.00 


0 


00 




Skn 


248S 




Skin 1 


0.13 


0 


02 




Smlnt 




Small 


0.02 


0 


01 


45 


Smlnt H89 




Small 


0.00 


0 


00 




Sto 






Stomach 1 


0.15 


0 


01 




Sto 


531S 




Stomach 2 


0.00 


0. 


00 




Tst647T 




Testis 1 


0.00 


0 


00 




Tat 


39X 




Testis 2 


0.30 


0. 


02 


50 


Thr 






Thyroid 1 


0.04 


0 


03 




Thr 


270T 




Thyroid 2 


0.11 


0. 


00 




Utrl35XO 




Uterus 1 


0.20 


0. 


00 
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|Utr Uterus 2 0.00 O.Oll 

1 14IXO _ | 

0= Negative 

In the analysis of matching samples/ higher expression 
5 of lngl29 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

Furthermore , we compared the level of mRNA expression 

10 in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows overexpression of 

15 Lngl29 in 38 lung cancer tissues compared with their 

respective normal adjacent (lung samples #2, 4, 6, 8, 9, 
10, 11, 12/ 13/ 14, 17/ 22/ 24, 25, 26, 28, 30, 31, 33/ 34, 
35/ 37, and 38) . There is overexpression in the cancer 
tissue for 61% of the lung matching samples tested (23 out 

20 of total of 38 lung matching samples) . 

Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl29 a good marker 
for diagnosing/ monitoring, staging, imaging and treating 
25 lung cancer. 

Northern Analysis 

Two transcripts - 6.5 kb and - 9 kb 

DNA sequence for Lngl29 

Sequence available from Incyte database. 

3 0 Primers Used for QPCR Expression Analysis 
Forward primer 

GCCTGTTTGGGAGATTAGATTTT (SEQ ID NO: 24) 
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Reverse primer 

GCCCAAACAGAACAGACTAAAAA (SEQ ID NO: 25) 
Probe 

AGGTTATTAGGTTATTATCTCTCTCTCCTGATTTTTCC (SEQ ID NO: 26) 

5 Example 3 
Sequence 3 
Lngll2 

Gene ID 983439 

Table 1. The absolute numbers are relative levels of 
10 expression of Lngll2 in 12 normal different tissues. These 
RNA samples are commercially available pools, originated by- 
pooling samples of a particular tissue from different 
individuals . 





Tissue 


NORMAL 


15 


Brain 


0 




Heart 


0 




Kidney 


0 




Liver 


0 




Lung 


1.0 


20 


Mammary 


0 




Muscle 


0 




Prostate 


0 




Smlnt 


0 




Testis 


0 


25 


Thymus 


0 




Uterus 


0 



0=negative 



The relative levels of expression in Table 1 show that 
Lngll2 mRNA expression is only detectable in lung compared 

3 0 with other normal tissues analyzed 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 

35 samples of a single individual in Table 2. 

Table 2. The absolute numbers are relative levels of 
expression of Lngll2 in 49 pairs of matching samples. All 
the values are compared to normal lung (calibrator) . A 
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matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 





Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
















WODMM 
JNUKrlfUJ 
















ADJACENT 




Lung 

"1 A IT 

143JL 




Adenocarcinoma. 


Lung 


1 


0 . 00 


0 . 00 




Lung 


60L 


Adenocarcinoma 


Lung 


2 


13 . 00 


9 , 00 




Lng 


AC82 


Adenocarcinoma 


Lung 


3 


0.00 


5.00 




Lng 


60XL 


Adeno c a r c i noma 


Lung 


4 


0.00 


19 . 00 


10 


Lng 


ACS 6 


Adenocarcinoma 


Lung 


5 


0.00 


59.00 




Lng 


AC69 


Adenocarcinoma 


Lung 


6 


9 . 00 


0 . 00 




Lng 


AC88 


Adenocarcinoma 


Lung 


7 


16.00 


35.00 




Lng 


AC11 


Adenocarcinoma 


Lung 


8 


0 .00 


36 . 00 




Lng 


AC32 


Adenocarc inoma 


Lung 


9 


0.00 


41.00 


JLD 


Lng 


AC39 


Adenocarcinoma 


Lung 


10 


0 . 00 


5 . 00 




Lng 


AC 94 


Adenocarcinoma 


Lung 


11 


0.00 


0.00 




Lng 




Bronchio-alveolar 


Lung 


12 


7 . 00 


14 . 00 




BA641 




carcinoma 












Lng 


SQ32 


Squamous cell 
carcinoma 


Lung 


13 


0 . 00 


228 . 00 


o n 
z u 


Lng 


SQ45 


S quamous cell 
carcinoma 


Lung 


14 


368 . 00 


2 . 00 




Lng 


SQ56 


Squamous cell 
carcinoma 


Lung 


15 


1 .00 


53 . 00 




Lng 


SQ9X 


Squamous cell 
carcinoma 


Lung 


16 


0 . 00 


2 . 00 






SQ14 


Squamous cell 
carcinoma 


Lung 


17 


0 . 00 


21 . 00 




Lng 


SQ16 


Squamous cell 
carcinoma 


Lung 


18 


U . Ul 


1.00 




Lng 


enon 


Squamous cell 
carcinoma 


■ Lung 


19 


6 . 00 


7 . 00 




Lng 


f""5 f\ V 
\~£. UA 


Squamous cell 
carcinoma 


Lung 


20 


yj . i/u 


n nn 
VJ . uu 




Lng 


4 /AU 


Squamous cell 
carcinoma 


Lung 


21 


1.00 


3.00 




Lng 


c?r\A A 


Squamous cell 
carcinoma 


Lung 


22 


0.00 


0.00 




Lng 


S079 


Squamous cell 
carcinoma 


Lung 


23 


n n n 
u . uu 


U . UU 


30 


Lng 


9 OX 


Squamous cell 
carcinoma 


Lung 


24 


0.00 


4.00 




Lng 


BR94 


Squamous cell 
carcinoma 


Lung 


25 


0.00 


0.00 




Lng 


LC71 


Large cell carcinoma Lung 


26 


178.00 


4.00 




Lng 


LC80 


Large cell carcinoma Lung 


27 


0.00 


0.00 




Lng 




Large cell carcinoma Lung 


28 


1.00 


96.00 


35 


LCI 09 
















Lung 


77L 


Large cell carcinoma Lung 


29 


0.00 


0.00 




Lng 


75XC 


Metastatic from bone Lung 


30 


0.00 


22.00 








cancer 












Lng 


MT67 


Metastatic from 
renal cell cancer 


Lung 


31 


1.00 


86.00 




Lng 


MT71 


Metastatic from 
melanoma 


Lung 


32 


0.00 


14.00 


40 


Bid 


32XK 




Bladder 1 


0.00 


0.00 




Cln 


AS45 




Colon 1 


0.00 


0.00 
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Cvx 


KS52 


Cervix 1 


0.00 


0.00 


End 


28XA 


Endometrium 1 


0.00 


0.00 


Kid 




Kidney 1 


0.00 


0.00 


106XD 










Liv 


94XA 


Liver 1 


0.00 


0.00 


Mam 


A06X 


Mammary 1 


0.00 


0.00 


Ovr 


103X 


Ovary 1 


0.00 


0.00 


Pan 


71XL 


Pancreas 1 


0.00 


0.00 


Pan 


77X 


Pancreas 2 


0.00 


0.00 


Pro 


20XB 


Prostate 1 


a n a 


A Art 


Skn 


287S 


Skin 1 


0.00 


0.00 


Smlnt 




Sm. Int. 1 


0.00 


0.00 


H89 










Sto 


531S 


Stomach 1 


0.00 


0.00 


Thr 


143N 


Thyriod 1 


0.00 


0.00 


Tst 


39X 


Testis 1 


0.00 


0.00 


Utr 




Uterus 1 


16.00 


0.00 


135X0 










0= Negative 










In the analysis of matching samples, 


except 


l uterus 



cancer sample the only detection was in lung samples 
showing a high degree of tissue specificity for lung 
tissue. These results confirm the tissue specificity 



results obtained with normal pooled samples (Table 1) . 

25 Furthermore, we compared the level of mRNA expression 

in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 

30 the normal adjacent) . Table 2 shows over expression of 
Lngll2 in 4 lung cancer tissues compared with their 
respective normal adjacent tissue in 32 cancer matching 
pairs (lung samples # 2, 6, 14, and 26) . There is 
overexpression in the cancer tissue for 12.5% of the lung 

35 matching samples tested (total of 32 lung matching 
samples) . 
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Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. The result shows that Lngll2 is 
expressed differentially in all 32 lung cancer tissues 
5 tested compared with their respective normal adjacent. 

Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngll2 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
10 lung cancer. 

Primers Used for QPCR Expression Analysis 

Forward primer 

TGGTGGCGTTCCTCCTGTC (SEQ ID NO: 27) 
Reverse primer 
15 CAGAGCCCTTCGTACTGGAACAC (SEQ ID NO: 28) 

Probe 

TCGTACAGGTCCTGGGTGCTCCACA (SEQ ID NO: 29) 

Example 4 
Sequence 4 
20 Lngll4 

Gene ID 236582 

Table 1. The absolute numbers are relative levels of 
expression of Lngll4 in 12 normal different tissues. All 
the values are compared to normal testis (calibrator) . 
25 These RNA samples are commercially available pools, 

originated by pooling samples of a particular tissue from 
different individuals. 



Tissue 


NORMAL 


Brain 


0.09 


Heart 


0.00 


Kidney 


0.26 


Liver 


0.00 


Lung 


602.58 


Mammary 


0.35 
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Muscle 0 . 00 

Prostate 0.00 

Smlnt 0.05 

Testis 1.00 

5 Thymus 0 . 00 

Uterus 1.27 



0=negative 

The relative levels of expression in Table 1 show that 
Lngll4 mRNA expression is highest in lung (602.58) compared 

10 with other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 

15 samples of a single individual in Table 2. 

Table 2. The absolute numbers are relative levels of 
expression of Lngll4 in 78 pairs of matching samples, 1 
normal ovary and 2 blood samples. All the values are 
compared to normal testis (calibrator) . A matching pair is 

20 formed by mRNA from the cancer sample for a particular 
tissue and mRNA from the normal adjacent sample for that 
same tissue from the same individual. 



30 



Sample 


Cancer Type 


Tissue 


CANCER 


MATCHING 


NORMAL 


ID 












NORMAL 
ADJACENT 




Lng 6 Oh 


Adenocarcinoma 


Lung 


1 


121. 


52 


66.72 




Lng 143L 


Adenocarcinoma 


Lung 


2 


360. 


79 


25.99 




Lng 


Adenocarcinoma 


Lung 


3 


79, 


81 


648.73 




60XL 
















Lng 


Adenocarcinoma 


Lung 


4 


37. 


53 


102.18 




AC82 
















Lng 


Adenocarcinoma 


Lung 


5 


530. 


06 


992.55 




AC88 














Lng AC66 


Adenocarcinoma 


Lung 


6 


76.68 


257.93 




Lng AC69 


Adenocarcinoma 


Lung 


7 


25. 


46 


8.51 




Lng AC11 


Adenocarcinoma 


Lung 


8 


54. 


19 


852.17 




Lng AC32 


Adenocarcinoma 


Lung 


9 


157. 


59 


193.34 




Lng AC94 


Adenocarcinoma 


Lung 


10 


2272. 


40 


112.99 




Lng 


Adenocarcinoma 


Lung 


11 


141. 


53 


38.85 




AC90 
















Lng 


Adenocarcinoma 


Lung 


12 


198. 


09 


8.40 




AC39 
















Lng 223L 


Adenocarcinoma 


Lung 


13 


10. 


63 


31.89 




Lng 528L 


Adenocarcinoma 


Lung 


14 


210, 


84 


274.37 




Lng BR26 


Bronchogenic 
carcinoma 


Lung 


15 


0. 


00 


169.48 
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Lng 


Bronchio-alveolar 


Lung 


16 


316 


27 


73 


.77 


BA641 


carcinoma 














Lng 315L 


Squamous cell 


Lung 


17 


0 


00 


469 


.51 




carcinoma 














Lng SQ14 


Squamous cell 


Lung 


18 


1016 


93 


15 


.83 




carcinoma 














Lng SQ56 


Squamous cell 


Lung 


19 


0 


78 


526 


.39 




carcinoma 














Lng SQ9X 


Squamous cell 


Lung 


20 


52 


89 


64 


.89 




carcinoma 














Lng SQ80 


Squamous cell 


Lung 


21 


60 


.34 


962 


.07 




carcinoma 














Lng SQ45 


Squamous cell 


Lung 


22 


97 


01 


357 


.05 




carcinoma 














Lng SQ16 


Squamous cell 


Lung 


23 


92 


.41 


1833 


.01 




carcinoma 














Lng SQ32 


Squamous cell 


Lung 


24 


23 


75 


31 


.02 




carcinoma 














Lng SQ79 


Squamous cell 


Lung 


25 


20 


89 


142 


.52 




carcinoma 














Lng 47XQ 


Squamous cell 


Lung 


26 


42 


52 


135 


.77 




carcinoma 














Lng BR94 


Squamous cell 


Lung 


27 


211 


.50 


157 


.78 




carcinoma 














Lng 90X 


Squamous cell 


Lung 


28 


80 


.73 


12 


.21 




carcinoma 














Lng C20X 


Squamous cell 


Lung 


29 


2 


.99 


15 


.24 




carcinoma 














Lng SQ44 


Squamous cell 


Lung 


30 


94 


03 


0 


.00 




carcinoma 














Lng SQ43 


Squamous cell 


Lung 


31 


27 


19 


38 


.85 




carcinoma 














Lng LC71 


Large cell 


Lung 


32 


1217 


75 


2040 


.91 




carcinoma 














Lnq 


Large cell 


Lung 


33 


160 


42 


4576 


.44 


LCI 09 


carcinoma 














Lng LC80 


Large cell 


Lung 


34 


955 


43 


400 


.32 




carcinoma 














Lng 77L 


Large cell 


Lung 


35 


18 


44 


78 


.52 




carcinoma 














Lng 


Metastatic from 


Lung 


36 


229 


13 


398 


.93 


75XC 


bone cancer 














Lng 


Metastatic from 


Lung 


37 


69 


07 


1514 


.89 


MT67 


renal cell cancer 














Lng 


Metastatic from 


Lung 


38 


42. 


37 


1393 


.99 


MT71 


melanoma 














Bid 46XK 




Bladder 1 


0. 


00 


0 


.00 


Bid 




Bladder 2 


0. 


00 


0 


.00 


66X 
















Bio B5 




Blood 1 










Blo B6 




Blood 2 










Cln AS43 




Colon 1 


1. 


22 


0 


.00 


Cln AS45 




Colon 2 


0. 


00 


0 


.00 


Cln AS46 




Colon 3 


2. 


08 


0 


.00 


Cln SG67 




Colon 4 


1. 


49 


1 


.39 


Cvx KS52 




Cervix 1 


2. 


39 


11 


.47 


Cvx KS83 




Cervix 2 


1. 


22 


4 


.55 


Endo 




Endometrium 


108. 


38 


2 


.86 


28XA 




1 












Endo 68X 




Endometrium 


3. 


73 


12 


.64 


KidlOXD 




2 

Kidney 1 


39. 


40 


0 


.00 
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Kid 


Kidney 2 


1 . 91 


8 .46 


109XD 








Kid 


Kidney 3 


1 .48 


4 . 61 


107XD 








Liv 15XA 


Liver 1 


0 . 03 


0 . 07 


Liv 201L 


Liver 2 


0 . 00 


0 . 00 


Liv 174L 


Liver 3 


0 . 00 


0 . 00 


Mam 162X 


Mammary 1 


0 . 78 


0 .28 


Mam 173M 


Mammary 2 


1 .00 


0 .00 


Mam 220 


Mammary 3 


2 .02 


0 .30 


Ovr 18GA 


Ovary 1 




0 . 00 


Ovr A084 


Ovary 2 


0.00 


0.00 


Pro 


Prostate 1 


0.86 


1.38 


101XB 








Pro 


Prostate 2 


0.00 


0.23 


109XB 








Prol25XB 


Prostate 3 


0.00 


0.08 


Pan 77X 


Pancreas 1 


0.00 


0.00 


Skn 39A 


Skin 1 


0.20 


0.00 


Skn 39AB 


Skin 2 


0.00 


0.00 


Skn 248S 


Skin 3 


0 . 00 


0 . 00 


Smint 


Sm. Int. 1 


0.110 


0.00 


21XA 








Smint 


Sm. Int. 2 


0.00 


0 . 00 


H89 








Sto 264S 


Stomach 1 


1.04 


5.98 


Sto 288S 


Stomach 2 


5 .10 


0.00 


Sto 115S 


Stomach 3 


5.03 


0.75 


Thr 143N 


Thyroid 1 


0.00 


0.94 


Thr 14 5T 


Thyroid 2 


1.89 


2.50 


Thr 


Thyriod 3 


1.52 


0.00 


939T 








Tst 


Testis 1 


10 .20 


0.00 


647T 








Tst 39X 


Testis 2 


8.20 


0.00 


Tst 663T 


Testis 3 


5.09 


0.00 


Utr 


Uterus 1 


13.18 


5.65 


141X0 








Utr 


Uterus 2 


1.47 


1.36 


135X0 








0= Negative 



In the analysis of matching samples, higher expression 
of lngll4 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 

45 confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 

50 indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows overexpression of 



WO 02/18576 



PCTAJS01/26684 



- 108 - 

Lngll4 in 11 lung cancer tissues compared with their 
respective normal adjacent tissue in 35 cancer matching 
pairs (lung samples #1, 2, 1, 10, 11, 12, IS, 16, 26, 28, 
and 32) . There is overexpression in the cancer tissue for 
5 31% of the lung matching samples tested (total of 35 
primary cancer lung matching samples) . 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 31% of the lung matching samples 
tested are believed to make Lngll4 a good marker for 
10 diagnosing, monitoring, staging, imaging and treating lung 
cancer. 

Primers Used for QPCR Expression Analysis 
Forward primer 

CTTGGCAGCTCACATGGAAC (SEQ ID NO: 30) 
15 Reverse primer 

CTGGGGTGTCTCTGTCACTCTC (SEQ ID NO: 31) 
Probe 

CCATGAAGTCCCACCCCTTTTCTCTG (SEQ ID NO: 32) 

Example 5 
20 Sequence 5 
Lngll8 

Gene ID 210995 

Table 1. The absolute numbers are relative levels of 
expression of Lngll8 in 24 normal different tissues. These 
25 RNA samples are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals . 



Tissue 


NORMAL 


Adrenal Gland 


0 


Bladder 


0 


Brain 


0.010 


Cervix 


0.010 


Colon 


0 


Endometrium 


0.010 
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Esophagus 


0 




Heart 


0 




Kidney 


0.010 




Liver 


0 




Lung 


1.000 




Mammary Gland 


0.010 




Muscle 


0.0032 




Ovary 


0.005 




Pancreas 


0 .005 




Prostate 


0.002 




Rectum 


0.004 




Small Intestine 


0 




Spleen 


0 




Stomach 


0.015 




Testis 


0.033 




Thymus 


0.001 




Trachea 


0.007 




Uterus 


0.005 


0 » negative 



20 The relative levels of expression in Table 1 show that 

Lngll8 mRNA expression is high in lung compared with other 

normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 

analyzing pools of samples of a particular tissue from 
25 different individuals. They can not be compared to the 

absolute numbers originated from RNA obtained from tissue 

samples of a single individual in Table 2 . 



Table 2. The absolute numbers are relative levels of 
expression of LngllS in 36 pairs of matching samples. All 
30 the values are compared to normal lung (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 


Lng60L 


Adenocarcinoma 


Lung 1 


0.1 


0.04 


LngAC66 


Adenocarcinoma 


Lung 2 


0 


0.27 


LngAC69 


Adenocarcinoma 


Lung 3 


0 


0.11 


Lng AC88 


Adenocar c inoma 


Lung 4 


0.05 


0.13 


Lng 60XL 


Adenocarcinoma 


Lung 5 


0 


0.08 


LngAC94 


Adenocarcinoma 


Lung 6 


0 


0 
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15 



20 



25 



LngACll 


Adenocarcinoma 


Lung 7 




0 


0. 


02 


LngAC32 


Adenocar c inoma 


Lung 8 




0 


0. 


05 


Img 47XQ 


Adenocarcinoma 


Lung 9 




0 




0 


Lng223L 


Adenocarcinoma 


Lung 10 




0 


0. 


01 


Lng BR2 6 


Bronchi o- alveolar 


Lung 11 




0 




0 


IjngSQ4 5 


Squamous cell 


Lung 12 


0 


54 




0 


LngSQl 6 


Squamous cell 


Lung 13 




0 




0 


LngSQ79 


Squamous cell 


Lung 14 




0 




0 


Lng LC71 


Large cell carcinoma 


Lung 15 


1 


23 


0. 


06 


Lng LCI 09 


Large cell carcinoma 


Lung 16 




0 


0. 


06 


Lng 75XC 


Metastatic from 


Lung 17 




0 




0 


BldTR17 




Bladder 1 




0 




0 


Cvx KS52 




Cervix 1 




0 




0 


ClnSG45 




Colon 1 




0 




0 


End 10479 




Colon 2 




o 




0 


Kid 106XD 




Endottie t r ium 1 




o 




o 






Kidnev 1 




o 


0 . 


01 


Ulv lu f u 




Kidnev 2 




o 




o 






Liver 1 




o 




o 


Mam <3Q£7 




Liver 2 




o 




o 


Ovr AO 84 




lYiaTnuiary ± 




u 




n 

\j 


Pan 71XL 




Ovary 1 


0 


.14 




0 


Pro 20XB 




Pancreas 1 




0 




0 


Pro 326 




Prostate 1 


0 


02 




0 


Smlnt H89 




Prostate 2 




0 




0 


Sto 531S 




Small 




0 




0 


Tst 39X 




Stomach 1 




0 




0 


Thr 270T 




Testis 1 




0 




0 


Thr 644T 




Thyroid 1 


0 


02 


0. 


01 



30 0= Negative 



In the analysis of matching samples, higher expression 
of lngll8 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 

35 pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

40 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngll8 in 17 lung cancer tissues compared 
with their respective normal adjacent tissue. 
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Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make LngllS a good marker 
for diagnosing, monitoring, staging, imaging and treating 
5 lung cancer. 

DNA sequence for Lngll8 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

10 TGCAGCAGAAAGGGGAGAG (SEQ ID NO: 33) 

Reverse primer 

TCCCCATTGCCCTCAAGT (SEQ ID NO: 34) 
Probe 

CGTGGGCACTCACCTCGGCACT (SEQ ID NO: 35) 



15 Example 6 
Sequence 6 
Lngl21 

Gene ID 2 08994 

Table 1. The absolute numbers are relative levels of 
20 expression of Lngl21 in 24 normal different tissues. All 
the values are compared to normal trachea (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 
different individuals. 



Tissue 

Adrenal Gland 

Bladder 

Brain 

Cervix 

Colon 

Endometrium 
Esophagus 
Heart 
Kidney 



NORMAL 
0.01 
0. 00 
0.55 
0.09 
0.02 
1.74 
0.08 
0.00 
0.04 
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15 





Liver 


0.00 




Lung 


117.38 




Mammary Gland 


0.47 




Muscle 


0.36 




Ovary 


0.41 




Pancreas 


0.10 




Prostate 


0.93 




Rectum 


0.05 




Small Intestine 


0.09 




Spleen 


1.72 




Stomach 


0.12 




Testis 


3.24 




Thymus 


2.06 




Trachea 


1.00 




Uterus 


0.12 


0=negative 



The relative levels of expression in Table 1 show that 
Lngl21 mRNA expression is high in lung compared with most 
other normal tissues analyzed. 

20 The absolute numbers in Table 1 were obtained 

analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2 . 

25 Table 2. The absolute numbers are relative levels of 

expression of Lngl21 in 20 pairs of matching samples. All 
the values are compared to normal trachea (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 

30 sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 








NORMAL 
ADJACENT 


Lng 60L 


Adenocarc inoma 


Lung 1 


9.95 


16.62 


Lng 143L 


Adenocarc inoma 


Lung 2 


1.07 


5.41 


Lng 60XL 


Adenocarcinoma 


Lung 3 


12.77 


5.08 


Lng AC88 


Adenocarcinoma 


Lung 4 


5.06 


31.89 


Lng AC66 


Adenocarcinoma 


Lung 5 


3.85 


22 .32 


Lng AC32 


Adenocarc inoma 


Lung 6 


8.46 


87.12 


Lng 223L 


Adenocarcinoma 


Lung 7 


1.87 


4.10 


Lng SQ14 


Squamous cell 
carcinoma 


Lung 8 


2.91 


33 .72 


Lng C20X 


Squamous cell 
carcinoma 


Lung 9 


0.08 


0.29 
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Lng 


77L 


Large cell 


carcinoma 


Liang 10 


8.13 


16 


,35 


Lng 


LC71 


Large cell 


carcinoma 


Lung 11 


47.84 


3 


.69 


Lng 


75XC 


Metastatic 


from 


Lung 12 


3.49 


15 


,67 






melanoma 












Cln 


AS43 






Colon 1 


1.22 


0 


.17 


Endo 


12XA 






Endometrium 


2.38 


0 


.29 


Kid 


107XD 






1 

Kidney 1 


0.44 


0 


.17 


Liv 


187L 






Liver 1 


0.03 


1 


.06 


Mam 


19DN 






Mammary 1 


1.41 


0 


.58 


Ovr 


A084 






Ovary 1 


0.76 


0 


.28 


Pro 


109XB 






Prostate 1 


0.19 


0 


.27 


Tat 


647T 






Testis 1 


2.92 


1 


.64 



0= Negative 



In the analysis of matching samples, higher expression 
of lngl21 is detected in lung samples showing a high degree 

15 of tissue specificity for lung tissue. These results 

confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 

20 from the same individual. This comparison provides an 

indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl21in 12 lung cancer tissues compared with 

25 their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl21 a good marker 
for diagnosing, monitoring, staging, imaging and treating 

30 lung cancer. 

DNA sequence for Lngl21 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

35 CAGGCTCATTTTATTGTGGTCAT (SEQ ID NO: 36) 

Reverse primer 

CCCACACTGATTTAGGCACATAG (SEQ ID NO: 37) 
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Probe 

TTTGAAGGAGGGCAGGAAAAACTATGTAAG (SEQ ID NO: 38) 



Example 7 
Sequence 7 
5 Lngl24 

Gene ID 1066498 

Table 1. The absolute numbers are relative levels of 
expression of Lngl24 in 24 normal different tissues. All 
the values are compared to normal lung (calibrator) . These 
10 RNA samples are commercially available pools, originated by 
pooling samples of a particular tissue from different 
individuals . 



Tissue 


NORMAL 


Adrenal Gland 


0 


Bladder 


0 


Brain 


0 


Cervix 


0 


Colon 


0 


Endometrium 


0 


Esophagus 


0 


Heart 


0 


Kidney 


0 


Liver 


0 


Lung 


1.00 


Mammary Gland 


0 


Muscle 


0 


Ovary 


0 


Pancreas 


0 


Prostate 


0 


Rectum 


0 


Small Intestine 


0 


Spleen 


0 


Stomach 


0 


Testis 


0 


Thymus 


0 


Trachea 


0 


Uterus 


0 



0=negative 

The relative levels of expression in Table 1 show that 
40 Lngl24 mRNA expression is only detectable in lung compared 
with most other normal tissues analyzed. 
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The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals . They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2* 
Table 2. The absolute numbers are relative levels of 
expression of Lngl24 in 40 pairs of matching samples. All 
the values are compared to normal lung (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


Lng60L 


Adenocarcinoma 


Lung 


1 


0 


.64 




Lng 143L 


Adenocarcinoma 


Lung 


2 


0 


.03 


0.26 


LngAC66 


Adenocarcinoma 


Lung 


3 


0 


.04 


0.51 


Lng 60XL 


Adenocarc inoma 


Lung 


4 


0 


.04 


0.36 


Lng AC88 


Adenocarcinoma 


Lung 


5 


0 


.11 


1.05 


LngACll 


Adenocarcinoma 


Lung 


6 


0 


.12 


2.48 


LngAC32 


Adenocarc inoma 


Lung 


7 


0 


.22 


0.64 


Lng 47XQ 


Adenocarcinoma 


Lung 


8 


0 


.19 


0.13 


Lng AC39 


Adenocarcinoma 


Lung 


9 


0 


.16 


0.7 


Lng AC90 


Adenocarcinoma 


Lung 


10 




0.1 


0.12 


Lng223L 


Adenocarc inoma 


Lung 


11 


0 


.05 


0.17 


Lng SQ14 


Squamous cell 


Lung 


12 




0 


0.69 




carcinoma 












Lng SQ9X 


Squamous cell 


Lung 


13 


0 


.15 


0.12 




carcinoma 












LngSQ16 


Squamous cell 


Lung 


14 


0 


.12 


0.24 




carcinoma 












LngSQ79 


Squamous cell 


Lung 


15 




0.1 


0.42 




carcinoma 












Lng SQ43 


Squamous cell 


Lung 


16 


0 


.14 


0.17 




carcinoma 












Lng BR94 


Squamous cell 


Lung 


17 


0 


.01 


0.03 




carcinoma 












Lng C20X 


Squamous cell 


Lung 


18 




0 


0.02 




carcinoma 












Lng LCI 09 


Large cell carcinoma Lung 


19 


0 


.06 


0.74 


Bid 66X 




Bladder 1 




0 


0 


Cvx NK23 




Cervix 1 




0 


0 


Cvx NK24 




Cervix 2 




0 


0 


ClnAS45 




Colon 2 




0 


0 


Cln RC24 




Colon 3 




0 


0 


End 8911 




Endometrium 




0 


0 


Kid 6XD 




1 

Kidney 1 




0 


0 


Kid 710K 




Kidney 2 




0 


0 


Liv 94XA 




Liver 1 




0 


0 


Mam 173M 




Mammary 2 




0 


0 


Mam S123 




Mammary 3 




0 


0 



15 



20 



25 



30 



35 



40 
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Ovr AO 82 


Ovary 1 


0 0 


Ovr C179 


Ovary 2 


0 0 


Ovr 13 OX 


Ovary 3 


0 


Pan 92X 


Pancreas 1 


0 0 


Fro J4B 


Prostate 1 


0 0 


Sto 531S 


Stomach 1 


0 0 


Sto AC93 


Stomach 2 


0 0 


Sto 288S 


Stomach 3 


0 0 


Sto TA73 


Stomach 4 


0 0 


Sto 288S 


Stomach 5 


0 0 


Sto 531S 


Stomach 6 


0 0 


Skn 28 7S 


Skin 1 


0 0 


Thr692T 


Thyroid 1 


0.02 0 


0= Negative 






In the analysis of 


matching samples 


, expression of 


lngl24 is only detected 


in lung samples 


(except 1 thyroid 



cancer sample) showing a high degree of tissue specificity 
for lung tissue. These results confirm the tissue 
specificity results obtained with normal pooled samples 
20 (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

25 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl24 in all of the 19 lung cancer tissues 
compared with their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 

30 the mRNA differential expression in all of the lung 

matching samples tested are believed to make Lngl24 a good 
marker for diagnosing, monitoring, staging, imaging and 
treating lung cancer. 

Primers Used for QPCR Expression Analysis 
35 Forward primer 

AGGGAGAGGAGCTATGGACGT (SEQ ID NO: 39) 
Reverse primer 

TTTTGAGGCAAGACTCCATCTC (SEQ ID NO: 40) 
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Probe 

CTGCCAAGGGAGAGAGTGAGGTAGGC (SEQ ID NO: 41) 

Example 8 
Sequence 8 
5 Lngl26 

Gene ID 287016 

Table 1. The absolute numbers are relative levels of 
expression of Lngl26 in 24 normal different tissues. All 
the values are compared to normal thymus (calibrator) . 
10 These RNA samples are commercially available pools, 

originated by pooling samples of a particular tissue from 
different individuals. 





Tissue 


NORMAL 




Adrenal Gland 


11.92 


15 


Bladder 


0.0 




Brain 


0.21 




Cervix 


0.7 




Colon 


0.06 




Endometrium 


6.36 


20 


Esophagus 


0.04 




Heart 


0.06 




Kidney- 


1.11 




Liver 


7.94 




Lung 


6.2 


25 


Mammary Gland 


7.46 




Muscle 


0.78 




Ovary 


38.32 




Pancreas 


2.69 




Prostate 


5.21 


30 


Rectum 


2.72 




Small 


0.6 




Spleen 


0.16 




Stomach 


0.93 




Testis 


3.2 


35 


Thymus 


1.00 




Trachea 


4.61 




Uterus 


3.90 



0=negative 

The relative levels of expression in Table 1 show that 
40 Lngl26 mRNA expression is relatively high in lung, except 
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adrenal gland and ovary, compared with other normal tissues 
analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
5 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of Lngl26 in 20 pairs of matching samples. All 
10 the values are compared to normal thymus (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


Lng 6 OL 


Adenocarcinoma 


Lung 1 


0.06 


0.03 


Lng 143L 


Adenocarcinoma 


Lung 2 


7.34 


0.45 


LngAC66 


Adenocarcinoma 


Lung 3 


0.10 


0.07 


LngAC69 


Adenocarc inoma 


Lung 4 


0.33 


0.04 


LngACll 


Adenocarcinoma 


Lung 5 


0.72 


0.30 


LngAC32 


Adenocarcinoma 


Lung 6 


0.14 


0.10 


LngAC94 


Adenocarcinoma 


Lung 7 


0.11 


0.01 


Lng223L 


Adenocarc inoma 


Lung 8 


0.01 


0.01 


LngSQ45 


Squamous cell 
carcinoma 


Lung 9 


0.43 


0.16 


Lng SQ14 


Squamous cell 
carcinoma 


Lung 10 


11.35 


2.61 


LngSQ16 


Squamous cell 
carcinoma 


Lung 11 


0.09 


0.01 


LngSQ79 


Squamous cell 
carcinoma 


Lung 12 


10.78 


0.14 


Lng C20X 


Squamous cell 
carcinoma 


Lung 13 


0.26 


0.00 


Lng 77L 


Large cell carcinoma 


Lung 14 


1.32 


7.14 


Bid 66X 




Bladder 1 


4.92 


43.56 


ClnAS45 




Colon l 


1.26 


1.28 


Mam 19DN 




Mammary 1 


14.62 


0.48 


Mam 220 




Mammary 2 


0.33 


0.61 


Mam S854 




Mammary 3 


0.66 


1.04 



0= Negative 
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In the analysis of matching samples, higher expression 
of lngl26 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 
5 pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
10 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl26 in 14 lung cancer tissues compared 
with their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
15 the mRNA differential expression in the lung matching 

samples tested are believed to make Lngl26 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer. 

DNA sequence for Lngl26 

20 Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

TGGGGACAATATGGACCTCA (SEQ ID NO: 42) 
25 Reverse primer 

GGCGAGTGTCTATGATGAACCT (SEQ ID NO: 43) 
Probe 

CAGGATCTGTGAGGATTTCATTTGGATACAT (SEQ ID NO: 44) 

Example 9 
30 Sequence 9 
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Lngl36 

Gene ID 10717 

ddx lung code SQLngOOl 

Table 1. The absolute numbers are relative levels of 
5 expression of Lngl36 in 24 normal different tissues. All 
the values are compared to normal spleen (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 
different individuals. 



10 


Tissue 


NORMAL 




Adrenal Gland 


0.34 




Bladder 


0.03 




Brain 


0.66 




Cervix 


0.12 


15 


Colon 


0.00 




Endometrium 


0.08 




Esophagus 


0.05 




Heart 


0.02 




Kidney- 


0.01 


20 


Liver 


0.00 




Lung 


8.54 




Mammary 


1.32 




Muscle 


0.00 




Ovary 


0.07 


25 


Pancreas 


0.86 




Prostate 


0.15 




Rectum 


0.02 




Small Int. 


0.05 




Spleen 


1.0 


30 


Stomach 


0.77 




Testis 


1.22 




Thymus 


0.19 




Trachea 


0.16 




Uterus 


0.03 


35 0 


= negative 





The relative levels of expression in Table 1 show that 
Lngl36 mRNA expression is high in lung compared with most 
other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
40 analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
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absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of Lngl36 in 60 pairs of matching samples. All 
5 the values are compared to normal spleen (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
















NOPMAL 

JIM \S Al LfUL 
















ADJACENT 


Lng60L 


XlUCIiUUCll V—- X llvJllld 


Lung 


1 


5 


.92 


A T7 


Lng 


143L 


Adenocarcinonia 


Lung 


2 


2 


.20 


2 . 58 


Lng 


AC82 


Adenorarri noma 


Lung 


3 


2 


.62 


o • o o 


LngAC66 


Adenocarcinoma 


Lung 


A 


1 


.36 


3.61 


Lng 


60XL 




Lung 


5 


0 


.66 


A 


Lng 


AC88 


Adeno care inoma 


Lung 


g 


3 


.61 


15.94 


LngAC69 


Adeno care i noma 


Lung 


7 


7 


.49 


7 1 Q 


LngACll 


Adenocarcinoma 


Lung 


g 


1 


.31 


31.45 


LngAC32 


Adenocarcinoma 


Luncr 


9 


7 


.41 


Q CI 


Lng 


AC90 


Adenocarcinoma 


Lung 


10 


16 


.34 


5 . 26 


Lng223L 


Adenocarcinoma 


Luna 


11 


2 


.41 


1 .49 


LngAC94 


Adenocarcinoma 


Lung 


12 


1.47 


2 . 09 


Lng 


BR26 


Bronchio-alveolar 
carcinoma 


Lung 


13 




0 


6 . 5 


LngSQ45 


Squamous cell 


Luncr 


14 


19 


.97 


2 . 18 






carcinoma 












Lng 


SQ14 


Squamous cell 
carcinoma 


Lung 


15 


1 


.76 


13 .04 


Lng 


SQ56 


Squamous cell 
carcinoma 


Lung 


16 


2 


.18 


12.73 


LngSQ16 


Squamous cell 


Lung 


17 


0 


.54 


5.30 






carcinoma 












Lng 


SQ32 


Squamous cell 
carcinoma 


Lung 


18 


3 


.31 


14.17 


Lng 


AC3 9 


Squamous cell 
carcinoma 


Lung 


19 


3 


.43 


15.08 


Lng 


47XQ 


Squamous cell 
carcinoma 


Lung 


20 


0 


74 


5.3 


LngSQ79 


Squamous cell 


Lung 


21 


2 


.53 


8.49 






carcinoma 












Lng 


C20X 


Squamous cell 
carcinoma 


Lung 


22 


0 


07 


CI. 22 


Lng 


SQ44 


Squamous cell 
carcinoma 


Lung 


23 


1 


48 


3.59 


Lng 


SQ43 


Squamous cell 
carcinoma 


Lung 


24 


1 


45 


0.91 


Lng 


LC71 


Large cell carcinoma 


Lung 


25 


11. 


79 


10.67 


Lng 


77L 


Large cell carcinoma 


Lung 


26 


9. 


25 


3.11 


Lng 


LC109 Large cell carcinoma 


Lung 


27 


6. 


87 


36.89 


Lng 


MT67 


Metastatic from renal 


Lung 


28 


2. 


93 


5.01 






cell cancer 










Lng 


MT71 


Metastatic from 


Lung 


29 


0. 


19 


1.23 






melanoma 










BldTR14 




Bladder 1 


0. 


25 


0.94 



WO 02/18576 



PCT/US01/26684 



122 



10 



15 



20 



25 



30 



Cvx NK24 


Cervix 1 


0.46 


0.14 


Cvx KS52 






n 0*5 


ClnAS43 




0 09 


0 07 




Colon2 


u . xu 


n rtc 

U .UD 


ClnAS46 




n 1 c 


U . X J 




fol on 4 


0 04. 


n 14 


ClnAS89 




0 in 
u . xu 


U.JO 


End A911 


RnrfnmohT'i linn 

1 


n n£ 


n i fl 
u . xo 


T3nH OQY3V 
OX1U Z OAA 


X 

xshqotuc l. x x um 




0 • IlZ 


ViH cvn 

IvJ-U 3AU 


ivxaneyx 


ft r» 1 


1 . 41 


7i T AQVTl 
IvLQ 1U?ALI 


tu.aney 2 


0.47 


0 .39 


T ■! CV3S 

Lil V Xt> 


Liverl 


0 . 11 


0 . 03 


T.i v 1 TAT. 
iilv X /£Xi 


Liver 2 


0 . 01 


0 . 01 


Mam Q1 0*5 


Mammary 1 


0 . 19 


ft 1 *7 

0 . 17 


Mam 1 COY 


Mammary 2 


ft 1 c 

0 . 15 


0 . 17 


uvr L.X / y 


Ovary 1 


0 


ft 


vJVX X j U A 


Ovary 2 


0 . 58 


0 


Dan *71 YT 
rail / X AXi 


Pancreas 1 


0 . 07 


0 




Pancreas 2 


6 . 94 


1 . 62 


Fro jzo 


Prostate 1 


0 . 04 


0 - 12 


D-r*/-k 1 A. QYT5 
IT i O X U 7 AX3 


Prostate 2 


U . 01 


0 . 01 




oKin x 


U . 04 


0 . x 


Smlnt H89 


Small 


0.13 


0.04 




intestine 1 






Smlnt 


Small 


0.18 


0 


21XA 


intestine 2 






Sto TA73 


Stomach 1 


2.9 


4.18 


Sto 758S 


Stomach 2 


0.77 


1.53 


Tst647T 


Testis 1 


0.58 


0.38 


Tst 39X 


Testis 2 


0.50 


1.02 


Thr 270T 


Thyroid 1 


0.03 


0.02 


Utrl35XO 


Uterus 1 


0.2 


0.48 


0= Negative 



In the analysis of matching samples, higher expression 
of lngl36 is detected in lung samples showing a high degree 

35 of tissue specificity for lung tissue. These results 

confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 

40 from the same individual. This comparison provides an 

indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl36in 29 lung cancer tissues compared with 

45 their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 



) 
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samples tested are believed to make Lngl36 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer 



DNA sequence for Lngl36 

5 Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

CTCCGTGGCTCGTGCTT (SEQ ID NO: 45) 
Reverse primer 
10 CGCTTTCTTTTTGCCCTCTTGT (SEQ ID NO: 46) 



Example 10 
Sequence 10 
Lngl43 

15 Gene ID 24945 

ddx lung code SQLng006 



Table 1. The absolute numbers are relative levels of 
expression of Lngl43 in 24 normal different tissues. All 
the values are compared to normal pancreas (calibrator) . 
20 These RNA samples are commercially available pools, 

originated by pooling samples of a particular tissue from 
different individuals. 



Tissue 



NORMAL 



Adrenal Gland 

Bladder 

Brain 

Cervix 

Colon 

Endometrium 

Esophagus 

Heart 

Kidney 

Liver 

Lung 



0.83 
0.04 
1.11 
0.20 
0.01 
2.49 
0.01 
0.09 
0.34 
0.23 
6.15 
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10 



Mammary Gland 
Muscle 
Ovary- 
Pane reas 
Prostate 
Rectum 

Small Intestine 

Spleen 

Stomach 

Testis 

Thymus 

Trachea 

Uterus 



2.34 
0.44 
4.20 
1.00 
6.34 
1.14 
0.16 
6.63 
1.13 
3.12 
7.39 
2.77 
6.04 



0=negative 

15 The relative levels of expression in Table 1 show that 

Lngl43 mRNA expression is much higher in lung compared 
with most other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

20 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of Lngl43 in 78 pairs of matching samples, 2 

25 blood samples and 2 normal ovary samples. All the values 
are compared to normal pancreas (calibrator) . A matching 
pair is formed by mRNA from the cancer sample for a 
particular tissue and mRNA from the normal adjacent sample 
for that same tissue from the same individual . 



Sample ID Cancer Type 


Tissue 


CANCER 


MATCHING NORMA 










NORMAL L 










ADJACENT 


Lng 60L 


Adenocarcinoma 


Lung 1 


0.54 


0.41 


Lng 143 


Adenocarcinoma 


Lung 2 


0.41 


0.08 


Lng 60XL 


Adenocarcinoma 


Lung 3 


1.09 


0.86 


Lng AC82 


Adenocarc inoma 


Lung 4 


3.25 


0.09 


Lng AC88 


Adenocarc inoma 


Lung 5 


3 .99 


0.93 


Lng AC66 


Adenocarc inoma 


Lung 6 


0.99 


0.42 


Lng AC69 


Adenocarcinoma 


Lung 7 


2.36 


0.50 


Lng AC11 


Adenocarcinoma 


Lung 8 


2.67 


1.80 


Lng AC32 


Adenocarcinoma 


Lung 9 


3.02 


0.43 


Lng AC39 


Adenocarcinoma 


Lung 10 


9.35 


0.13 


Lng AC94 


Adenocarc inoma 


Lung 11 


0.58 


0.26 


Lng AC90 


Adenocarc inoma 


Lung 12 


3 .85 


0.01 



30 



35 



40 
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Lng 2231, 
ling 528L 
Lng BR26 

Lng BA641 

Lng SQ45 

Lng 315L 

Lng SQ14 

Lng SQ9X 

Lng SQ56 

Lng SQ80 

Lng SQ32 

Lng SQ16 

Lng SQ79 

Lng 90X 

Lng 47XQ 

Lng BR94 

Lng C20X 

Lng SQ44 

Lng SQ43 

Lng LC71 

Lng LC109 

Lng LC80 

Lng 77L 

Lng 75XC 

Lng MT71 

Lng MT67 

Bld46XK 
BldTR14 
Bio B5 
Bio B6 
Cvx KS52 
Cvx KS83 
ClnAS43 
ClnAS45 
ClnAS46 
ClnAS67 
ClnAS89 
End 10479 



Adenocarcinoma 
Adenocarcinoma 
Bronchogenic 
carcinoma 
Bronchio-alveolar 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Squamous cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Large cell 
carcinoma 
Metastatic from 
bone cancer 
Metastatic from 
renal cell cancer 
Metastatic from 
melanoma 



Lung 13 


0.32 


0.02 


Lung 14 


10,52 


3.77 


Lung 15 


8.40 


0.28 


Lung 16 


2.58 


0.37 


Lung 17 


4.61 


1.53 


Lung 18 


1.15 


1.16 


Lung 19 


1.83 


0.78 


Lung 20 


2.70 


0.14 


Lung 21 


2.50 


1.53 


Lung 22 


2.69 


0.77 


Lung 23 


7.70 


1.51 


Lung 24 


0.70 


0.04 


Lung 25 


3.61 


0.92 


Lung 26 


1.24 


0.23 


Lung 27 


1.90 


0.13 


Lung 28 


2.87 


0.00 


Lung 29 


0.05 


0.04 


Lung 30 


0.21 


2.13 


Lung 31 


2.86 


0.04 


Lung 32 


1.94 


1.82 


Lung 33 


4.04 


4.30 


Lung 34 








0 . 03 


1 . 08 


Lung 36 


0.15 


0.19 


Lung 37 


5.96 


0.74 


Lung 38 


12.30 


1.18 


Bladder 1 


0.03 


0.02 


Bladder 2 


2.89 


1.51 


Blood 1 






Blood 2* 






Cervix 1 


5.78 


1.44 


Cervix 2 


17.75 


4.29 


Colon 1 


3.42 


0.10 


Colon 2 


0.17 


0.13 


Colon 3 


2.29 


1.92 


Colon 4 


0.20 


0.33 


Colon 5 


0.08 


0.12 


Endometrium 


25.63 


4.63 



21.19 
41.21 



WO 02/18576 



PCT/US01/26684 



- 126 - 



10 



15 



20 



25 



30 



End 28XA 


Endomet r ium 


6.25 


2 .46 


End 68X 


2 

Endometrium 


6.43 


11.24 


KidlOXD 


3 

Kidney 1 


3.73 


1.07 


Kid 109XD 


Kidney 2 


2.90 


4.82 


LivlSXA 


Liver 1 


0.19 


0.08 


liv 174 L 


Liver 2 


0.99 


0.76 


Mara 173 M 


Mammary 1 


0.76 


0.47 


Mam 220 


Mammary 2 


0.11 


0.23 


Mam 355 


Mammary 3 


1.08 


0.19 


Mam 976M 


Mammary 4 


0.02 


0.16 


ovr 180B 


ovary 1 


16.11 




Ovr 18GA 


Ovary 2 


15 . 14 




Ovr A0R4 


uvaiy o 


ft 


C CO 


Pan 77X 


Da t"> f 3 ct 1 


QA 


O 1 
Z . DJ. 


Pan 92X 


.trails. Lccto ^ 


A Qft 


x . / u 


Pro 1(11 Y~R 


rrOoLaLc J. 


1 £ Q 


*5 "1 IT 

^ . lb 


Pro 10QXB 


T5>*^ o t* a ♦* a *5 
riuoLdtc ^ 


U . 10 


U . Z J 


Pro 


"D>™y"\ e a ^ a ^ 

riOStatc ,5 


n in 


U . LZ 


Pro 13XB 




c\ r\A 

U • Urfc 


U . jX 


O /VIA J J £\ 


Qlri n 1 

oKin x 


T 1 Q 

x . iy 


n no 


Skn 816S 


Qlri'n *? 

OlVXll ^ 


U ■ J3 


u . ux 


Smlnh 


Small 




n in 

u . 1U 


21XA 


lllUcbLlXxc X 






Smlnt H89 


Small 


0 . 66 


0 Afi 




Intestine 2 






Sto 115S 


Stomach 1 


1.91 


1.20 


Sto 264S 


Stomach 2 


0.74 


0.99 


Sto288S 


Stomach 3 


2.78 


0.06 


Tst647T 


Testis 1 


1.87 


2.68 


Tat 663T 


Testis 2 


7.89 


0.66 


Thr 270T 


Thyroid 1 


2.01 


2.13 


Thr 93 9T 


Thyroid 2 


0.50 


0.55 


Utrl35XO 


Uterus 1 


3.52 


6.06 


Utr 141X0 


Uterus 2 


2.59 


2.57 


0= Negative 



35 In the analysis of matching samples, higher expression 

of lngl43 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

40 Furthermore, we compared the level of mRNA expression 

in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 

45 the normal adjacent) . Table 2 shows differential 

expression of Lngl43 in 3'8 lung cancer tissues compared 
with their respective normal adjacent tissue. 
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Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl43 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
5 lung cancer. 

DNA sequence for Lngl43 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

10 CCGACCTTGAGATTATTCCTGT (SEQ ID NO: 47) 

Reverse primer 

GCACCACTTAAACCAAATCCA (SEQ ID NO: 48) 
Probe 

TGCTGCCAACACCACTTCTCCATCT (SEQ ID NO: 49) 

15 Example 11 
Sequence 11 
Lngl44 

Gene ID 52017 

ddx lung code SQlng007 

20 Table 1. The absolute numbers are relative levels of 

expression of Lngl44 in 24 normal different tissues. All 
the values are compared to normal uterus (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 

25 different individuals. 



Tissue 


NORMAL 


Adrenal Gland 


0.04 


Bladder 


1.29 


Brain 


0.44 


Cervix 


0.85 


Colon 


0.00 


Endometrium 


0.43 
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Em e> Opila^US 


u . UD 


riearu 


U . Uo 


Kidney 


0 . 10 


Liver 


0.30 


Lung 


1 . ib 


Mammary Gland 


1 . 04 


Muscle 


0 . 34 


Ovary 




Pancreas 


0 . 77 


Prostate 


0.93 


Rectum 


0.26 


Small 


0.11 


Spleen 


3.92 


Stomach 


0.30 


Testis 


1.1 


Thymus 


0.93 


Trachea 


0.69 


Uterus 


1.00 



0=negative 



20 The relative levels of expression in Table 1 show that 

Lngl44 mRNA expression is high in lung compared with other 
normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

25 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of Lngl44 in 30 pairs of matching samples. All 

30 the values are compared to normal uterus (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 





Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


35 


Lng 60L 


Adenocarcinoma 


Lung 1 


0.65 


0.30 




Lng 143L 


Adenocarc inoma 


Lung 2 


0.29 


0.17 




Lng AC66 


Adenocarc inoma 


Lung 3 


0 


0.34 




Lng AC69 


Adenocarcinoma 


Lung 4 


1.41 


1.89 




Lng AC11 


Adenocarcinoma 


Lung 5 


2.82 


3.11 


40 


Lng AC32 


Adenocarcinoma 


Lung 6 


1.09 


1.27 




Lng AC94 


Adenocarcinoma 


Lung 7 


2.20 


0.84 
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Lng 223L 


Adenocarcinoma 


Lung 8 


0 


.30 


0.27 


Lng BR26 


Bronchio-alveolar 


Lung 9 


1 


.25 


0.36 




carcinoma 










Lng SQ45 


Squamous cell 


Lung 10 


2 


.96 


1.26 




carcinoma 










Lng SQ9X 


Squamous cell 


Lung 11 


1 


.49 


0.30 




carcinoma 










Lng SQ80 


Squamous cell 


Lung 12 


1 


.88 


1.51 




carcinoma 










Lng SQ16 


Squamous cell 


Lung 13 


0 


.48 


0.47 




carcinoma 










Lng SQ79 


Squamous cell 


Lung 14 


2 


.77 


0.00 




carcinoma 










Lng 9 OX 


Squamous cell 


Lung 15 


0 


.09 


0.29 




carcinoma 










Lng SQ43 


Squamous cell 


Lung 16 


0 


.81 


0 .26 




carcinoma 










Lng LC71 


Large cell Carcinoma I 


Lung 17 
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cancer 










Bld46XK 




Bladder 1 


0 


.24 


0.08 


BldTR14 




Bladder 2 


0 


.18 


0.95 


ClnAS45 




Colon2 


0 


.12 


0.05 


ClnAS46 




Colon3 


0 


.21 


0.98 


ClnAS67 




Colon4 


0 


.09 


0.18 


ClnAS89 




Colon 5 


0 


.38 


3.31 


ClnAS43 




Colons 


0 


.18 


0.47 


LivlSXA 




Liver 1 


0 


.47 


0.05 


Tst647T 




Testis 1 


2 


.10 


0.26 


Utrl35XO 




Uterus 1 


0 


.81 


0.80 



0= Negative 



25 In the analysis of matching samples, higher expression 

of lngl44 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

30 Furthermore, we compared the level of mRNA expression 

in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 

35 the normal adjacent) . Table 2 shows differential 

expression of Lngl44 in 20 lung cancer tissues compared 
with their respective normal adjacent tissue. 
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Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl44 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
5 lung cancer. 

DNA sequence for Lngl44 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

10 TGCTGCCACAAACCGAGA (SEQ ID NO: 50) 

Reverse primer 

TTGGGAGGGTTGGTTGGTT (SEQ ID NO: 51) 
Probe 

TTTTGAGGGCACTAGGGAACGATCTGT (SEQ ID NO: 52) 

15 Example 12 
Sequence 12 
Lngl38 

Gene ID 460254 

ddx lung code SQlngllO 

20 Table 1. The absolute numbers are relative levels of 

expression of Lngl38 in 24 normal different tissues. All 
the values are compared to normal spleen (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 

25 different individuals. 



Tissue 


NORMAL 


Adrenal Gland 


0.00 


Bladder 


0.00 


Brain 


0.04 


Cervix 


0.09 


Colon 


0.00 


Endometrium 


0.31 
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Osnegative 



20 The relative levels of expression in Table 1 show that 

Lngl38 mRNA expression is high in lung compared with other 
normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

25 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of Lngl38 in 50 pairs of matching samples. All 

3 0 the values are compared to normal spleen (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 





Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


35 


Lng 60L 


Adenocarcinoma 


Lung 1 


1.66 


0.82 




Lng 143L 


Adenocarcinoma 


Lung 2 


0.06 


0.03 




Lng AC82 


Adenocarcinoma 


Lung 3 


0.02 


0.03 




Uig AC66 


Adenocarcinoma 


Lung 4 


0.1 


0.79 




Lng AC69 


Adenocarcinoma 


Lung 5 


0.56 


0.65 


40 


Lng 60 XL 


Adenocarcinoma 


Lung 6 


0.02 


0.15 




Lng AC94 


Adenocar c inoma 


Lung 7 


0.67 


0.65 




Lng AC11 


Adenocarcinoma 


Lung 8 


0.41 


3.32 




Lng AC32 


Adenocarcinoma 


Lung 9 


0.47 


1.57 
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In the analysis of matching samples, higher expression 
of lngl38 is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 
5 pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
10 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl38 in 27 lung cancer tissues compared 
with their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
15 the mRNA differential expression in the lung matching 

samples tested are believed to make Lngl3 8 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer. 

DNA sequence for Lngl38 

20 Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

CCTGATACCTTTAACCAATGCTCT (SEQ ID NO: 53) 
Reverse primer 
25 TTGGGTAGTATCAAATGGGTAAGG (SEQ ID NO: 54) 

Probe 

CCTGTCCTTCTCCTTTGGCTTATGCTATCC (SEQ ID NO: 55) 

Example 13 

Sequence 13 
30 Lngl37 

Gene ID 179090 
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ddx lung code SQLng012 

Table 1. The absolute numbers are relative levels of 
expression of Lngl37 in 24 normal different tissues. All 
the values are compared to normal spleen (calibrator) . 
5 These RNA samples are commercially available pools, 

originated by pooling samples of a particular tissue from 
different individuals. 





Tissue 


NORMAL 




Adrenal Gland 


0 . 042 




Bladder 


0 . 063 




Brain 


0 .285 




Cervix 


n i Qfi 
u.iyo 




Colon 


0.080 




Endometrium 


0.956 


15 


Esophagus 


0.025 




Heart 


0.010 




Kidney- 


0.046 




Liver 


0.035 




Lung 


0.204 


20 


Mammary Gland 


0.142 




Muscle 


0.092 




Ovary 


0.760 




Pancreas 


0.084 




Prostate 


0.355 


25 


Rectum 


0.357 




Small Intestine 


0.074 




Spleen 


1.000 




Stomach 


0.103 




Testis 


2.612 


30 


Thymus 


10.853 




Trachea 


0.076 




Uterus 


0.235 



0=negative 



The relative levels of expression in Table 1 show that 
35 Lngl37 mRNA expression is relatively high in lung compared 
with most other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
40 absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2 . 
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Table 2. The absolute numbers are relative levels of 
expression of Lngl37 in 70 pairs of matching samples. All 
the values are compared to normal spleen (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
5 a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 


Lng60L 


Adenocarcinoma 


Lung 


1 


0 


.92 


0.67 


Lng 143L 


Adenocarcinoma 


Lung 


2 


0 


.53 


0.04 


Lng AC82 


Adenocarcinoma 


Lung 


3 




2.7 


0 


LngAC66 


Adenocar c inoma 


Lung 


4 


1 


.62 


0.34 


LngAC69 


Adenocar c inoma 


Lung 


5 


3 


.18 


0.79 


Lng AC 8 8 


Adenocar c inoma 


Lung 


6 


1 


.87 


0.32 


Lng 60 XL 


Adenocarc inoma 


Lung 


7 


3 


.42 


0.24 


LngAC94 


Adenocarcinoma 


Lung 


8 




0 


0.21 


LngACll 


Adenocarcinoma 


Lung 


9 


23 


.43 


2.76 


LngAC32 


Adenocarcinoma 


Lung 


10 


5 


.17 


0.63 


Lng 47XQ 


Adenocarcinoma 


Lung 


11 


2 


.03 


0 
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Adenocarcinoma 


Lung 


12 


4 


.69 


0 
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Adenocarcinoma 


Lung 


13 


2 


.48 


0.22 


Lng223L 


Adenocarcinoma 


Lung 


14 


1 


.89 


0 


Lng 52 8L 


Adenocarcinoma 


Lung 


15 


1 


.47 


0 


Lng BR26 


Bronchio-alveolar 


Lung 


16 


13 


.18 


0.47 


Lng BA641 


Squamous cell 


Lung 


17 


0 


.97 


0.18 


Lng 315L 


Squamous cell 


Lung 


18 


0 


.63 


0.62 
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Squamous cell 


Lung 


19 
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Squamous cell 
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20 


2 
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0.29 
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1 
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10 
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10 
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0 


.13 
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Lng 9 OX 


Squamous cell 


Lung 


31 


0 
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In the analysis of matching samples, higher expression 



of lngl37 are detected in lung samples showing a high 
degree of tissue specificity for lung tissue. These 

35 results confirm the tissue specificity results obtained 
with normal pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 

40 indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows overexpression of 
Lngl37 in 31 lung cancer tissues compared with their 
respective normal adjacent tissue in 38 cancer matching 

45. pairs (lung samples #2, 3, 4, 5, 6, 7, 9, 10 , 11, 12, 13, 
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14, 15, 16, 19, 20, 21, 22, 23, 24, 25, 26, 21, 28, 29, 31, 
33, 34, 35, 36, and 37). There is overexpression in the 
cancer tissue for 82% of the lung matching samples tested 
(total of 38 lung matching samples) . 
5 Altogether, the high level of tissue specificity, plus 

the mRNA overexpression in 82% of the lung matching samples 
tested are believed to make Lngl37 a good marker for 
diagnosing, monitoring, staging, imaging and treating lung 
cancer . 
10 Northern Analysis 

One transcript - 3.4 kb 

DNA sequence for Lngl37 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
15 Forward primer 

CTCGGATATGATTAAAGAGTTTCG (SEQ ID NO: 56) 
Reverse primer 

TCCACTGTGCTGTTTGTTGTT (SEQ ID NO: 57) 
Probe 

20 ATTGGCGTGCTCTTTGTAACTCTGAGA (SEQ ID NO: 58) 

Example 14 
Sequence 14 
Lngl42 

Gene ID 6348 

25 ddxlung code SQlng004 

Table 1. The absolute numbers are relative levels of 
expression of Lngl42 in 24 normal different tissues. All 
the values are compared to normal lung (calibrator) . These 
RNA samples are commercially available pools, originated by 

30 pooling samples of a particular tissue from different 
individuals . 
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Thymus 


0.00 




Trachea 


0.01 


25 


Uterus 


0.03 



0=negative 

The relative levels of expression in Table 1 show that 
Lngl42 mRNA expression is high in lung compared with most 
other normal tissues analyzed. 

3 0 The absolute numbers in Table 1 were obtained 

analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 

35 Table 2. The absolute numbers are relative levels of 

expression of Lngl42 in 20 pairs of matching samples. All 
the values are compared to normal lung (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 

40 sample for that same tissue from the same individual. 
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Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 






Ti11Tlf"T 1 

uuiiy x 


0 ^ ft 


A OO 


Lng AC66 


Adenocar c inoraa 


Lung 2 


0.00 


0.05 


-Oily riV— O j 


nUCllOCa JTVJ 1 IlUUlct 


Luxicj 3 


A AH 


A A A 

0 , uu 


lolly J. J. 


AQcnuCaTulIlUiua 


J-iuny ft 


a An 


1 . J / 


.Lilly ALja 


nuciio c a rc moma 


Lung 5 


a a c 


A 1 £ 

0 . lo 


Liny /iv_jf± 


Adenocarcinoma 


Lung 6 


a a q 
0 . Uo 


A A A 
0 . 00 


Jolly JJJ 


Ausno car c xnoma 


Lung 7 


A A 
0 . 


A A A 


iilly oy*t 3 


Scfuamous cell 
care xnoma 


Lung 8 




U . 02 


T -nrr Cm C 


oquatuous cen 
carcinoma 


Lung 9 


ft a a 


A A A 
0 . 00 


T, nn cr\*7o 
Jjlly oy / y 


ocjuamous ceil 
carcinoma 


Lung 10 


A A A 
0 . DO 


A 11 
0 . 11 


Bid 46XK 




Bladder 1 


0.00 


0 .00 


Bid TR14 




Bladder 2 


0.00 


0.00 


Cln AS45 




Colon 1 


0.00 


0.00 


Cln AS46 




Colon 2 


0.00 


0.00 


Cln AS67 




Colon 3 


0.00 


0.01 


Cln AS89 




Colon 4 


0.01 


0.02 


Cln AS43 




Colon 5 


0.00 


0.00 


Liv 15XA 




Liver 1 


0.00 


0.00 


Tst 647T 




Testis 1 


0.00 


0.05 


Utr 135X0 




Uterus 1 


0.17 


0.00 



0= Negative 



In the analysis of matching samples, higher expression 
of lngl42 is detected in lung samples showing a high degree 

25 of tissue specificity for lung tissue. These results 

confirm the tissue specificity results obtained with normal 
pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 

30 from the same individual. This comparison provides an 

indication of specificity for the cancer stage (e.g. higher 
levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl42 in 10 lung cancer tissues compared 

35 with their respective normal adjacent tissue. 
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Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl42 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
5 lung cancer. 

DNA sequence for Lngl42 

Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

10 TGGCTAAAATAGGTCTTGTAGGGA (SEQ ID NO: 59) 

Reverse primer 

CAAGGAGGGGGCATTTGTA (SEQ ID NO: 60) 
Probe 

TCCTTTCCTTGGCAATCTCCTCTCCTG (SEQ ID NO: 61) 

15 Example 15 
Sequence 15 
Lngl40 

Gene ID 94694 

ddx lung code SQLngOOS 

20 Table 1. The absolute numbers are relative levels of 

expression of Lngl40 in 24 normal different tissues. All 
the values are compared to normal mammary gland 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 

25 from different individuals. 



30 



Tissue 


NORMAL 


Adrenal Gland 


0 


Bladder 


0.00 


Brain 


0.00 


Cervix 


0.00 


Colon 


0.00 


Endometrium 


0.14 
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Esophagus 


0.00 


Heart 


0.00 


Kidney 


0.00 


Liver 


0.00 


Lung 


183.55 


Mammary Gland 


1.00 


Muscle 


0.00 


Ovary 


0.00 


Pancreas 


0.00 


Prostate 


0.10 


Rectum 


0.06 


Small 


0.03 


Spleen 


0.00 


Stomach 


0.02 


Testis 


0.01 


Thymus 


0.00 


Trachea 


3.72 


Uterus 


0.00 



0=negative 

20 The relative levels of expression in Table 1 show that 

Lngl40 mRNA expression is much higher in lung compared 
with most other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

25 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2 . 
Table 2. The absolute numbers are relative levels of 
expression of Lngl40 in 78 pairs of matching samples, 2 

30 blood samples, 1 normal ovary and 1 cancer ovary sample. 
All the values are compared to normal mammary gland 
(calibrator) . A matching pair is formed by mRNA from the 
cancer sample for a particular tissue and mRNA from the 
normal adjacent sample for that same tissue from the same 

35 individual. 



Sample Cancer Type 


Tissue 


CANCER 


MATCHING NORMAL 


ID 






NORMAL 








ADJACENT 


Lng 60L Adenocarcinoma 


Lung 1 


21.56 


3.54 


Lng 14 3 L Adenocarcinoma 


Lung 2 


0.00 


7.31 


Lng 60XL Adenocarcinoma 


Lung 3 


88.03 


19.84 


Lng AC82 Adenocarcinoma 


Lung 4 


4.61 


122.36 


Lng AC88 Adenocarcinoma 


Lung 5 


4.61 


70.77 
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Lng 


AC66 


Adenocarcinoma 


Lung 


6 


38. 


19 


7 


.46 






ACS 9 


RrfpTinraTPi noma 


Lung 


7 


36 . 


00 


25 


. 99 




TjTICf 


nvlJ. 


fiUClLULal L» XllULUd 


Lung 


Q 
D 


1287 . 


1 ft 


280 


. 14 




Lng 


AC32 


Adenocar c inoma 


Lung 


9 


o a o 


ob 


0 


A A 


5 


Lng 


AC94 


Adenocarcinoma 


Lung 


10 


39. 


81 


153 


.28 




Lng 


AC90 


Adenocarcinoma 


Lung 


11 


78. 


25 


420 


.22 




Lng 


AC39 


Adenocarcinoma 


Lung 


12 


600. 


49 


5 


.17 




Lng 


223L 


Adenocarcinoma 


Lung 


13 


5. 


60 


3 


.56 




Lng 


528L 


Adenoc ar c inoma 


Lung 


14 


6. 


17 


28 


.05 


10 


Lng 


BR26 


Bronchogenic 


Lung 


15 


4. 


68 


23 


.18 




Lng 




Bronchio- alveolar 


Lung 


16 


263. 


20 


4 


.11 




BA641 


r*a rr i n nma 

tw>CL^ l« <LUUI lid 
















Lng 


315L 


Squamous cell 

parpi noma 


Lung 


17 


0, 


00 


3 


.77 




Lng 


SQ14 


Squamous cell 

narpi noma 


Lung 


18 


0, 


74 


5 


.60 


15 


Lng 


SQ56 


Squamous cell 

oaiT**t noma 


Lung 


19 


36. 


25 


186 


.75 




Lng 


SQ9X 


Squamous cell 

par/ii HrtTB3 
v_cL.L .LllwIllCL 


Lung 


20 


77. 


98 


1 


.99 




Lng 


SQ80 


Squamous cell 

UQI I* -LilUlllcl 


Lung 


21 


20. 


32 


35 


.02 




Lng 


SQ45 


Squamous cell 

p T~r^ 1 noma 


Lung 


22 


153. 


28 


80 


.73 




Lng 


SQ16 


Squamous cell 


Lung 


23 


9, 


45 


13 


.04 


20 


Lng 


SQ32 


Squamous cell 

pa vp "i noma 


Lung 


24 


3213 


66 


99 


.04 




Lng 


SQ79 


Squamous cell 

pstpi noma 


Lung 


25 


594 


28 


48 


.17 




Lng 


47XQ 


Squamous cell 

parpi noma 


Lung 


26 


47 


84 


0 


.00 




Lng 


BR94 


Squamous cell 

parp -J noma 


Lung 


27 


4 


66 


0 


.00 




Lng 


9 OX 


Squamous cell 

pa tp i noma 


Lung 


28 


0 


00 


6 


.41 


25 


Lng 


C20X 


Squamous cell 

pa rp i noma 


Lung 


29 


2 


35 


0 


.00 




Lng 


SQ44 


Squamous cell 
csjrc inoma 


Lung 


30 


6 


59 


1 


.55 




Lng 


SQ43 


Squamous cell 
c ar c i noma 


Lung 


31 


25. 


19 


0 


.00 




Lng 


LC71 


Large cell 
c ar c inoma 


Lung 


32 


1408 , 


55 


97 


. 01 




Lng 




Large cell 


Lung 


33 


85. 


92 


922 


.88 


30 


LC109 


carcinoma 
















Lng 


LC80 


Large cell 
carcinoma 


Lung 


34 


99. 


39 


11 


.16 




Lng 


77L 


Large cell 
carcinoma 


Lung 


35 


8. 


69 


11 


.35 




Lng 


75XC 


Metastatic from 
bone cancer 


Lung 


36 


0. 


00 


0 


.00 




Lng MT67 


Metastatic from 


Lung 


37 


0. 


00 


2 


.28 








renal cell cancer 














35 


Lng 


MT71 


Metastatic from 
melanoma 


Lung 


38 


1. 


56 


0 


.00 




Bld46XK 




Bladder 1 


0. 


00 


0 


.00 




BldTR14 




Bladder 2 


0. 


00 


168 


.90 




Bio 


B5 




Blood 1 












Bio 


B6 




Blood 2 










40 


Cvx 


KS52 




Cervix 1 


85. 


33 


0 


.00 
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Cvx KS83 


Cervix 2 


23.51 


0.00 




ClnAS43 


Colon! 


259.57 


0.00 




ClnAS45 


Colon2 


0.00 


0.00 




ClnAS46 


Colon3 


14.52 


0.00 


5 


ClnAS67 


Colon4 


5.90 


41.64 




ClnAS89 


Colons 


5.13 


6.54 




End 


Endometri 


0.00 


0.00 




10479 


urn 1 








End 28XA 


Endometri 


38.45 


1.29 






um 2 






10 


End 68X 


Endometri 


0.00 


2.30 






um 3 








KidlOXD 


Kidney 1 


0.00 


0.00 




Kid 


Kidney 2 


0.00 


0.00 




109XD 










LivlSXA 


Liver 1 


0.00 


0.00 


15 


liv 174 

T 


Liver 2 


0.00 


0.00 




Jj 

Mam 173 


Mammary 1 


0.87 


0.00 




M 

Mam 220 


Mammary 2 


0.00 


0.00 


20 


Mam 355 


Mammary 3 


0.00 


0.00 




Mam 976M 


Mammary 4 


0.00 


0.00 




ovr 18 OB 


ovary 1 


0.00 


0.00 




Ovr 18GA 


Ovary 2 




0.00 




Ovr AO 84 


Ovary 3 


36.50 




25 


Pan 77X 


Pancreas 


0.00 


0.00 




Pan 92X 


1 

Pancreas 


46 .53 


0 . 00 




Pro 


2 

Prostate 


0.29 


1.43 




101XB 


1 








Pro 


Prostate 


0.00 


0.00 


30 


109XB 


2 








Pro 


Prostate 


1.30 


1.97 




12 5XB 


3 








Pro 13XB 


Prostate 
4 


0.00 


0.00 




Skn 39A 


Skin 1 


0.00 


0.00 


35 


Skn 816S 


Skin 2 


0.00 


0.00 




Smlnt 


Small 


1.45 


1.70 




21XA 


Intestine 








Sntint 


1 

bmaii 


79 . 07 


1 . ± / 




H89 


Intestine 
2 






40 


StO 115S 


Stomach 1 


109.14 


10.16 




Sto 264S 


Stomach 2 


2.53 


0.00 




Sto288S 


Stomach 3 


0.00 


0.00 




Tst647T 


Testis 1 


0.00 


0.00 




Tst 663T 


Testis 2 


0.00 


0.00 


45 


Thr 270T 


Thyroid 1 


0.00 


0.00 




Thr 93 9T 


Thyroid 2 


0.00 


0.00 




Utrl35XO 


Uterus 1 


3.89 


0.00 




Utr 


Uterus 2 


0.29 


1.43 



141X0 



50 0= Negative 
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In the analysis of matching samples, the higher 
expression level of lngl40 is detected in lung samples 
showing a high degree of tissue specificity for lung 
tissue. These results confirm the tissue specificity 
5 results obtained with normal pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

10 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows overexpression of 
Lngl40 in 20 lung cancer tissues compared with their 
respective normal adjacent tissue in 38 cancer matching 
pairs (lung samples #1, 3, 6, 8, 9, 12, 13, 16, 20, 22, 24- 

15 27, 29-32, 34, and 38). There is overexpression in the 
cancer tissue for 53% of the lung matching samples tested 
(total of 38 lung matching samples) . 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 53% of the lung matching samples 

20 tested are believed to make Lngl40a good marker for 

diagnosing, monitoring, staging, imaging and treating lung 
cancer. 

Primers Used for QPCR Expression Analysis 
Forward primer 

25 CCTCTGAAGAAACGATCACAACA (SEQ ID NO: 62) 

Reverse primer 

ATTCCAGCCTGAGTCACACAGA (SEQ ID NO: 63) 
Probe 

ACCAAGGAGAAACAAAACCAAGCAGCA (SEQ ID NO: 64) 

30 Example 16 
Sequence 16 
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LnglSl 

Gene ID 145812 
ddxlung code SQlng008 

Table 1. The absolute numbers are relative levels of 
5 expression of LnglSl in 24 normal different tissues. All 
the values are compared to normal thymus (calibrator) . 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 
different individuals. 



10 


Tissue 


NORMAL 




Adrenal Gland 


0.01 




Bladder 


0.01 




Brain 


0.03 




Cervix 


0.07 


15 


Colon 


0.01 




Endometrium 


0.16 




Esophagus 


0.02 




Heart 


0.00 




Kidney 


0.01 


20 


Liver 


0.00 




Lung 


0.17 




Mammary Gland 


0.06 




Muscle 


0.04 




Ovary 


0.44 


25 


Pancreas 


0.05 




Prostate 


0.04 




Rectum 


0.03 




Small Intestine 


0.01 




Spleen 


0.13 


30 


Stomach 


0.02 




Testis 


0.03 




Thymus 


1.00 




Trachea 


0.09 




Uterus 


0.09 



35 0= negative 



The relative levels of expression in Table 1 show that 
LnglSl raRNA expression is high in lung compared with most 
other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
40 analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
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absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2 . 
Table 2. The absolute numbers are relative levels of 
expression of LnglSl in 20 pairs of matching samples. All 
the values are compared to normal thymus (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


Lng 60L 


Adenocarcinoma 


Lung 1 


0.35 


0.18 


Lng AC66 


Adenocarcinoma 


Lung 2 


0.37 


0.34 


Lng AC69 


Adenocarcinoma 


Lung 3 


1.99 


0.32 


Lng AC11 


Adenocarcinoma 


Lung 4 


1.13 


1.11 


Lng AC32 


Adenocarcinoma 


Lung 5 


0.75 


0.23 


Lng AC94 


Adenocarc inoma 


Lung 6 


0.2 


0.1 


Lng 223L 


Adenocarcinoma 


Lung 7 


0.06 


0 


Lng SQ45 


Squamous cell 
carcinoma 


Lung 8 


2.45 


0.94 


Lng SQ16 


Squamous cell 
carcinoma 


Lung 9 


0.18 


0.05 


Lng SQ79 


Squamous cell 
carcinoma 


Lung 10 


1.23 


0.62 


Bid 46XK 




Bladder 1 


0.06 


0 


Bid TR14 




Bladder 2 


0.27 


0.37 


Cln AS43 




Colon 5 


0.27 


0.04 


Cln AS45 




Colon 1 


0.02 


0.04 


Cln AS46 




Colon 2 


0.04 


0.15 


Cln AS67 




Colon 3 


0.03 


0.28 


Cln AS89 




Colon 4 


0.05 


0.32 


Liv 15XA 




Liver 1 


0.22 


0.08 


Tst 647T 




Testis 1 


0.26 


0.21 


Utr 135X0 




Uterus 1 


1.14 


1.06 



10 



15 



20 



25 



30 



0= Negative 

In the analysis of matching samples, higher expression 
of InglSl is detected in lung samples showing a high degree 
of tissue specificity for lung tissue. These results 
confirm the tissue specificity results obtained with normal 

35 pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 

40 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
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expression of LnglSl in 10 lung cancer tissues compared 
with their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
the mRNA differential expression in the lung matching 
5 samples tested are believed to make LnglSl a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer. 

DNA sequence for LnglSl 

Sequence available from Incyte database. 

10 Primers Used for QPCR Expression Analysis 
Forward primer 

TGAGGAGAAAGAAGGGAATCAC (SEQ ID NO: 65) 
Reverse primer 

TCCTAAGGTAGCACTATTTGGAGAC (SEQ ID NO: 66) 
15 Probe 

AGCAATGAAGAATGAACTTGGAGTAAAGAGTCA (SEQ ID NO: 67) 

Example 17 
Sequence 17 
LnglSO 

20 Gene ID 10713 

ddx lung code SQlng002 

Table 1. The absolute numbers are relative levels of 
expression of LnglSO in 24 normal different tissues. All 
the values are compared to normal testis (calibrator) . 
25 These RNA samples are commercially available pools, 

originated by pooling samples of a particular tissue from 
different individuals . 





Tissue 


NORMAL 




Adrenal Gland 


0.00 


30 


Bladder 


0.04 




Brain 


0.01 
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1 A O 




Cervix 


U . \i\J 




Colon 


0 . 00 




Endometrium 






Esophagus 


ft ftft 

u * uu 




Heart 


0 00 




Kidney- 


ft m 

U . Ul 




Liver 


ft ftft 

u . uu 




Lung 


ft ft! 

u . ux 




Mammary Gland 


ft ftft 
u • uu 




Muscle 


u ■ uu 




Ovary 


ft ftft 




Pancreas 


0 . 00 




Prostate 


ft ft q 

U . Uj 




Rectum 


0 . 00 




Small Intestine 


ft on 
u . uu 




Spleen 


0 . 00 




Stomach 


0.00 




Testis 


1.00 




Thymus 


0.00 




Trachea 


0.01 




Uterus 


0.07 


0= 


^negative 





The relative levels of expression in Table 1 show that 
Lngl50 mRNA expression is detected in lung and is not 

25 detectable in adrenal gland, cervix, colon, esophagus, 
heart, liver, mammary gland, muscle, ovary, pancreas, 
rectum, small intestine, spleen, stomach, and thymus. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 

30 different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 
samples of a single individual in Table 2. 
Table 2. The absolute numbers are relative levels of 
expression of LnglBO in 40 pairs of matching samples. All 

35 the values are compared to normal testis (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 
a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 










NORMAL 










ADJACENT 


Lng 60L 


Adenocarcinoma 


Lung l 


0.00 


0.01 


Lng AC88 


Adenocarcinoma 


Lung 2 


0.02 


0.00 
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Lng AC66 


Adenocarcinoma 


Lung 3 


0.00 


0.24 


ling AC69 


Adenocarcinoma 


Lung 4 


0.02 


0 .00 


LngACll 


Adenocarcinoma 


Lung 5 


0.00 


0.20 


Lng AC32 


Adeno c a r c i noma 


Luna 6 


0 . 04 


0 . 00 


Lng AC39 


Adenocarcinoma 


Lung 7 


0.00 


0.00 


Lng AC94 


Adenocarcinoma 


Lung 8 


0. 00 


0 ..00 


Lng AC90 


Adenocarcinoma 


Lung 9 


0.00 


0.00 


Lng 22 3L 


Adeno care inoma 


Lung 10 


0 . 00 


0 . 00 


Lng BR26 


Bronchio -alveolar 
carcinoma 


Lung 11 


0.02 


0.02 


Lng SQ45 


Bronchogenic 
carcinoma 


Lung 12 


1.08 


0.04 


Lng SQ9X 


Squamous cell 
carcinoma 


Lung 13 


0.00 


0.00 


Lng SQ80 


San anions eel 1 
c a r c inoma 




u ■ UJ 




Lng SQ16 


Scruamous cell 
carcinoma 




0 .48 


0 . 00 


Lna S079 


carcinoma 


Tiiino* "1 


ft ftft 


n nn 

U . UU 


Lna 47XO 


carcinoma 


XJUXl^ X f 


ft nn 


c\ ftft 


Lna S043 


Scniamous eel 1 
c arc i noma 


Til inn T ft 

XJ Lilly X O 


0 . 00 


ft ftft 
u . u u 


Bid 46XK 




DXdUUCl X 




ft ftft 


Bid TR14 




Rl adder 9 


0 . 00 


0 . 54 


Blad66X 




Bladder 3 


0 . 000 


0 . 00 


ClnAS43 




fol on 1 


34 . 13 


n ftft 
u . u u 


Cln AS45 






0 . 00 


n ftft 

u . u u 


Cln AS46 




Colon 3 


0 . 04 


0 . 00 


Cln AS67 




Pnl on 4 

^.VJXUli. 




n m 

U . UX 


Cln AS 8 9 




Pr>l on ^ 


n m 


X . DX 


Cln DC63 




Crtl on £ 


n c\o 

\f • U^ 


ft ftft 

u . u u 


Endo 6 8X 




TUn H OTtioi" t* "i iytt\1 


n it 

U • XX 


ft 0*3 
u . 


Endo 12XA 




Endomp t r i nmP 


0 . 03 


0 . 06 


Kid6XD 




Kidney 1 


0. 01 


0 . 03 


Kid710K 




Kidney2 


0.00 


'0.00 


Liv 15XA 




Liver 1 


0.06 


0.01 


Liv201L 




Liver2 


0.00 


0.03 


Mam986 




Mamma ryl 


0.00 


0.00 


Sto 288S 




Stoma chl 


0.00 


0.00 


Sto531S 




Stomach2 


0.00 


0.02 


Tst39X 




Testisl 


0.03 


0.07 


Tst 64 7T 




Testis 2 


0.02 


0.09 


Thr590D 




Thyroid 1 


0.01 


0.00 


Utrl35XO 




Uterus 2 


0.00 


1.25 



0= Negative 

40 In the analysis of matching samples, higher expression 

of lngl50 is detected in lung samples showing a relatively 
high degree of tissue specificity for lung tissue. These 
results confirm the tissue specificity results obtained 
with normal pooled samples (Table 1) . 

45 Furthermore, we compared the level of mRNA expression 

in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
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levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of LnglSO in 18 lung cancer tissues compared 
with their respective normal adjacent tissue. 
5 Altogether, the high level of tissue specificity, plus 

the mRNA differential expression in the lung matching 
samples tested are believed to make LnglSO a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer. 
10 DNA sequence for LnglSO 

Primers Used for QPCR Expression Analysis 
Forward primer 

ATGGGCAGGTCTTTCTTTCC (SEQ ID NO: 68) 
Reverse primer 
15 AGGCAGTTCTGTTACCCCACTA (SEQ ID NO: 69) 

Probe 

TGTGCTAAGGACAGGATTGGTTGGGTA (SEQ ID NO: 70) 

Example 18 
Sequence 18 
20 Lngl41 

Gene ID 20152 

ddx lung code SQlng003 

Table 1. The absolute numbers are relative levels of 
expression of Lngl41 in 24 normal different tissues. All 
25 the values are compared to normal brain (calibrator) • 
These RNA samples are commercially available pools, 
originated by pooling samples of a particular tissue from 
different individuals . 





Tissue 


NORMAL 


30 


Adrenal Gland 


0.04 




Bladder 


0.00 




Brain 


1.00 
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Cervix 


0 . 77 




Colon 








0 . 36 




E s ophagu s 


a nn 




He art 


a ao 




Kidney 


A AC 
u • UD 




Liver 


rt nn 
U . UU 




Lung 


J . ft 3 




Mamma. ry Gland. 


A QQ 




Muscle 


ft 11 




Ovsry 






Pancreas 


A ft ^ 




riOStoLc 


ft Ol 
U. Jl 




Rectum 


0.65 




Small Intestine 


0.04 




Spleen 


0.70 




Stomach 


0.07 




Testis 


0.28 




Thymus 


0.91 




Trachea 


0.69 




Uterus 


1.27 


0=negative 



The relative levels of expression in Table 1 show that 
Lngl41 mRNA expression is high in lung compared with most 

25 other normal tissues analyzed. 

The absolute numbers in Table 1 were obtained 
analyzing pools of samples of a particular tissue from 
different individuals. They can not be compared to the 
absolute numbers originated from RNA obtained from tissue 

30 samples of a single individual in Table 2. 

Table 2. The absolute numbers are relative levels of 
expression of Lngl41 in 50 pairs of matching samples. All 
the values are compared to normal brain (calibrator) . A 
matching pair is formed by mRNA from the cancer sample for 

35 a particular tissue and mRNA from the normal adjacent 
sample for that same tissue from the same individual. 



Sample ID 


Cancer Type 


Tissue 


CANCER 


MATCHING 
NORMAL 
ADJACENT 


Lng 60L 


Adenocarcinoma 


Lung 1 


7.14 


1.89 


Lng 143L 


Adenocarcinoma 


Lung 2 


0.11 


0.18 


Lng 60XL 


Adenocarcinoma 


Lung 3 


0.19 


0.39 


Lng ACS 2 


Adenocarcinoma 


Lung 4 


0.29 


0.00 


Lng AC66 


Adenocarcinoma 


Lung 5 


5.06 


1.62 
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Lng 


AC69 


Adenocarcinoma 


Lung 6 


27 


.00 


4.17 




I*ng 


AC11 


Adenocarcinoma 


Lung 7 


1 


.52 


3.43 




Lng 


AC32 


Adenocarcinoma 


Lung 8 


2 


.14 


2.94 




Lng 


AC94 


Adenocarc inoma 


Lung 9 


2 


.45 


4.35 


5 


Lng 


223L 


Adenocarcinoma 


Lung 10 


0 


.00 


1.21 




Lng 


BR25 


Bronchio- alveolar 
carcinoma 


Lung 11 


1 


.15 


0.00 




Lng 


BA641 


Bronchogenic 
carcinoma 


Lung 12 


19 


.84 


0.80 




Lng 


SQ45 


Squamous cell 
carcinoma 


Lung 13 


31 


.02 


7.78 




Lng 


SQ14 


Squamous cell 
carcinoma 


Lung 14 


0 


.29 


0.58 


10 


Lng 


SQ9X 


Squamous cell 
carcinoma 


Lung 15 


0 


.54 


0.50 




Lng 


SQ56 


Squamous cell 
carcinoma 


Lung 16 


1 


.08 


0.52 




Lng 


SQ80 


Squamous cell 
carcinoma 


Lung 17 


0 


.55 


0.85 




Lng 


SQ32 


Squamous cell 
carcinoma 


Lung 18 


0 


.68 


0.91 




Lng 


SQ16 


Squamous cell 
carcinoma 


Lung 19 


0 


.90 


0 .79 


15 


Lng 


SQ79 


Squamous cell 
carcinoma 


Lung 20 


8 


.11 


6.87 




Lng 


9 OX 


Squamous cell 
carcinoma 


Lung 21 


0 


.00 


0.38 




Lng 


47XQ 


Squamous cell 
carcinoma 


Lung 22 


0 


.24 


0.28 




Lng 


BR94 


Squamous cell 
carcinoma 


Lung 23 


0 


.30 


0.00 




Lng 


SQ43 


Squamous cell 
carcinoma 


Lung 24 


2 


.86 


0.19 


20 


Lng 


LC71 


Large cell carcinoma Lung 25 


0 


.62 


2.30 




Lng 


LCI 09 


Large cell carcinoma Lung 26 


0 


.09 


1.33 




Lng 


MT67 


Metastasis from 
renal carcinoma 


Lung 27 


0 


.63 


0.67 




Bid 


TR14 




Bladder 2 


0 


.00 


0.00 




Bid 


46XK 




Bladder 3 


0 


.00 


0.00 


25 


Cln 


AS89 




Colon 1 


2 


.40 


11.55 




Cln 


AS67 




Colon 2 


1 


.43 


1.89 




Cln 


AS45 




Colon 3 


0 


.00 


■ 0.27 




Cln 


AS46 




Colon 4 


0 


.00 


0.00 




Cln 


AS43 




Colon 5 


6 


.25 


0.00 


30 


End 


28XA 




Endome t r iuml 


1 


.25 


1.59 




kid 


10XD 




Kidney 1 


0 


.28 


1.02 




Kid 


109XD 




Kidney2 


0 


.23 


0.64 




Liv 


15XA 




Liver 1 


1 


.22 


0.84 




Maml73M 




Mammary 1 


0 


.19 


0.47 


35 


Mam 


220 




Mammary 2 


0 


.00 


0.31 




Mam 


355 




Mammay 3 


0 


.36 


0.09 




Ovr 


A084 




Ovary 1 


3 


.22 


0.94 




Pro 


101XB 




Prostate 1 


0 


.55 


58.28 




Pro 


109 XB 




Prostate 2 


0 


.11 


0.21 


40 


Pro 


125XB 




Prostate 3 


0 


.24 


0.26 




Sto 


115S 




Stomach 1 


0 


.30 


0.26 




StO 


264S 




Stomach 2 


0 


.35 


0.26 




Sto 


288S 




Stomach 3 


0 


.06 


0.00 




Tst 


647T 




Testis 1 


6 


.87 


3.96 


45 


Utr 


135X0 




Uterus 1 


0 


.00 


6.02 


0 


= Negative 
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In the analysis of matching samples, higher expression 
of lng!41 is detected in lung samples showing a relatively 
high degree of tissue specificity for lung tissue. These 
results confirm the tissue specificity results obtained 
5 with normal pooled samples (Table 1) . 

Furthermore, we compared the level of mRNA expression 
in cancer samples and the isogenic normal adjacent tissue 
from the same individual. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher 
10 levels of mRNA expression in the cancer sample compared to 
the normal adjacent) . Table 2 shows differential 
expression of Lngl41 in 27 lung cancer tissues compared 
with their respective normal adjacent tissue. 

Altogether, the high level of tissue specificity, plus 
15 the mRNA differential expression in the lung matching 
samples tested are believed to make Lngl41 a good marker 
for diagnosing, monitoring, staging, imaging and treating 
lung cancer. 

DNA sequence for Lngl41 

20 Sequence available from Incyte database. 

Primers Used for QPCR Expression Analysis 
Forward primer 

ACTGCCCACCACGCTTTATA (SEQ ID NO: 71) 
Reverse primer 
25 TGAGGGTGGGGAGAGGTTAC (SEQ ID NO: 72) 

Probe 

AGTCACATTATTAGAGGTTCGCATCTCAGG (SEQ ID NO: 73) 
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What is claimed is ; 

1. A LSG comprising: 

(a) a polynucleotide of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, or 74, or 

5 a variant thereof; 

(b) a polypeptide expressed by a polynucleotide of SEQ 
ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20, or 74, or a variant thereof; or 

(c) a polynucleotide which is capable of hybridizing 
10 under stringent conditions to the antisense sequence of SEQ 

ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 
16, 17, 18, 19, 20 or 74. 

2 . The LSG of claim 1 wherein the polypeptide 
comprises SEQ ID NO: 75, 76, 77, 78, 79, 80, 81, 82, 83, or 
15 84. 

3 . A method for diagnosing the presence of lung 
cancer in a patient comprising: 

(a) determining levels of a LSG of claim 1 in cells, 
tissues or bodily fluids in a patient; and 
20 (b) comparing the determined levels of LSG with levels 

of LSG in cells, tissues or bodily fluids from a normal 
human control, wherein a change in determined levels of LSG 
in said patient versus normal human control is associated 
with the presence of lung cancer. 

25 4 . A method of diagnosing metastases of lung cancer 

in a patient comprising: 

(a) identifying a patient having lung cancer that is 
not known to have metastasized; 

(b) determining levels of a LSG of claim 1 in a sample 
30 of cells, tissues, or bodily fluid from said patient; and 

(c) comparing the determined LSG levels with levels of 
LSG in cells, tissue, or bodily fluid of a normal human 
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control, wherein an increase in determined LSG levels in 
the patient versus the normal human control is associated 
with a cancer which has metastasized. 

5, A method of staging lung cancer in a patient 
5 having lung cancer comprising: 

(a) identifying a patient having lung cancer; 

(b) determining levels of a LSG of claim 1 in a 
sample of cells, tissue, or bodily fluid from said patient; 
and 

10 (c) comparing determined LSG levels with levels of LSG 

in cells, tissues, or bodily fluid of a normal human 
control, wherein an increase in determined LSG levels in 
said patient versus the normal human control is associated 
with a cancer which is progressing and a decrease in the 

15 determined LSG levels is associated with a cancer which is 
regressing or in remission. 

6 . A method of monitoring lung cancer in a patient 
for the onset of metastasis comprising: 

(a) identifying a patient having lung cancer that is 
20 not known to have metastasized; 

(b) periodically determining levels of a LSG of claim 
1 in samples of cells, tissues, or bodily fluid from said 
patient; and 

(c) comparing the periodically determined LSG levels 
25 with levels of LSG in cells, tissues, or bodily fluid of a 

normal human control, wherein an increase in any one of the 
periodically determined LSG levels in the patient versus 
the normal human control is associated with a cancer which 
has metastasized. 

30 7. A method of monitoring a change in stage of lung 

cancer in a patient comprising: 

(a) identifying a patient having lung cancer; 
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(b) periodically determining levels of a LSG of claim 
1 in cells, tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined LSG levels 
with levels of LSG in cells, tissues, or bodily fluid of a 

5 normal human control, wherein an increase in any one of the 
periodically determined LSG levels in the patient versus 
the normal human control is associated with a cancer which 
is progressing in stage and a decrease is associated with a 
cancer which is regressing in stage or in remission. 

10 8. A method of identifying potential therapeutic 

agents for use in imaging and treating lung cancer 
comprising screening compounds for an ability to bind to or 
decrease expression of a LSG of claim 1 relative to the LSG 
in the absence of the compound wherein the ability of the 

15 compound to bind to the LSG or decrease expression of the 
LSG is indicative of the compound being useful in imaging 
and treating lung cancer. 

9. An antibody which specifically binds a 
polypeptide encoded by a LSG of claim 1. 

20 10. The antibody of claim 9 wherein the polypeptide 

comprises SEQ ID NO: 75, 76, 77, 78, 79, 80, 81, 82, 83 or 
84. 

11. A method of imaging lung cancer in a patient 
comprising administering to the patient an antibody of 

25 claim 9. 

12. The method of claim 11 wherein said antibody is 
labeled with paramagnetic ions or a radioisotope. 
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13 . A method of treating lung cancer in a patient 
comprising administering to the patient a compound which 
downregulates expression or activity of a LSG of claim 1. 



14 . A method of inducing an immune response against a 
5 target cell expressing a LSG of claim 1 comprising 

delivering to a human patient an immunogenically 
stimulatory amount of a LSG polypeptide so that an immune 
response is mounted against the target cell. 

15. The method of claim 14 wherein the LSG 

10 polypeptide comprises SEQ ID NO: 75, 76, 77, 78, 79, 80, 81, 
82, 83 or 84. 



16. A vaccine for treating lung cancer comprising a 
LSG of claim 1. 
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<110> Chen, Sei Yu 

Macina, Roberto 
Sun, Yongming 
Recipon, Herve 
diaDexus, Inc. 



<120> Compositions and Methods Relating to Lung Specific 
Genes 



<130> DEX-0231 



<140> 
<141> 



<150> 60/228,378 
<151> 2000-08-28 



<160> 84 



<170> Patentln Ver. 2.1 



<210> 1 

<211> 1361 

<212> DNA 

<213> Homo sapiens 



<400> 1 

caacctgtct gtgtctgccc aggcctggag ttgtgtgacc ctccccaccg cctggccttc 60 
tccatggggg ctggcctttt ctcggtggtg ggcaccctgc tgctgcccgg cctggctgcg 120 
cttgtgcagg actggcgtct tctgcagggg ctgggtgccc tgatgagtgg actcttgctg 180 
ctcttttggg ggaggaggtg gagggagccg tgggcatcct caccaacgct gcaggttccc 240 
ggccctgttc cccgagtctc cctgctggct gctggccaca ggtcaggtag ctcgagccag 300 
gaagatcctg tggcgctttg cagaagccag tggcgtgggg ccccggggac agttccttgg 360 
aggagaactc cctggctaca gagctgacca tgctgtctgc acggagcccc cagccccggt 420 
accactcccc actggggctt ctgcgtaccc gagtcacctg gagaaacggg cttatcttgg 480 
gcttcagctc gctggttggt ggagagcatc agagctagct tccgccgcag cctggcacct 540 
caggtgccga ccttctacct gccctacttc ctggaggccg gcctggaggc ggcagccttg 600 
gtcttcctgc tcctgacggc agattgctgt ggacgccgcc ccgtgctgct gctgggcacc 660 
atggtcacag gcctggcatc cctgctgctc ctcgctgggg cccagtatct gccaggctgg 720 
actgtgctgt tcctctctgt cctggggctc ctggcctccc gggctgtgtc cgcactcagc 780 
agcctcttcg cggccgaggt cttccccacg gtgatcaggg gggccgggct gggcctggtg 840 
ctgggggccg ggttcctggg ccaggcagcc ggccccctgg acaccctgca cggccggcag 900 
ggcttcttcc tgcaacaagt cgtcttcgcc tcccttgctg tccttgccct gctgtgtgtc 960 
ctgctgctgc ctgagagccg aagccggggg ctgccccagt cactgcagga cgccgaccgc 1020 
ctgcgccgct ccccactcct gcggggccgc ccccgccagg accacctgcc tctgctgccg 1080 
ccctccaact cctactgggc cggccacacc cccgagcagc actagtcctg cctggtggcc 1140 



1 
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ctgggagcca ggatgggacc aaagtcaagg 

tggtccaggg cagacacatt cctctcagaa 

ggacagcgtg aaggtgtctc cagccaggcc 

cagccacacc cagtaggtgt ggaggataaa 



cctggggcat ggctgagtac cccagacgtc 1200 
gcccgtgtct cagtgcaggt ggagccgtgg 1260 
ccaggcactg ggaggccctg ggtctccccc 1320 
ggcttctgtg g 1361 



<210> 2 

<211> 1408 

<212> DNA 

<213> Homo sapiens 

<400> 2 

caaatattaa ctttgttttt ttaatagaag 
tctaaaatga gcaagttagc ctcaatagca 
agcttaacaa ggaggagctt gtgttcctac 
gttggcttta tggtgattct agaactgggc 
ttccaatgag ttacagcaga ggagattggc 
agaaacaaag aatttaaagt ggattggcta 
ggaataggga aacagaataa taaataactg 
ggatgccaat tgaaactggc ctgtttggga 
ctctctctcc tgatttttcc aaaggccaga 
aactttatcg tgggtgattc cattttgatt 
gagcttagtg caaaacaaca gcctcctaaa 
tttttcatca gataatattt atttgtattc 
tcccgagaat ctttgctcag aggaattttt 
attgtattta cctccccggt gtattgaatt 
ccacctggac gtcaaatgat tgccatcaga 
tatgaaacag ctgacggcgg ctacatgact 
aaaaacatct acctgactct tcctcccaat 
acgttatgcc atgtggtcac actctcagct 
gttaaaggaa aatttaaatg gagactggaa 
cttagaaata gctttaactt tgcttaaact 
actacataca agcataagca aaacttaact 
aaataagaca accccagcca atcacaagca 
actttctaag aagataccta cccccaaaaa 
cctttatttt gcttccacat tttcccaa 



aatactctga attcctttca agcaatacaa 60 
ccccaaaata gaagttcttg gtatcttaac 120 
tgatgtcaaa agaaatgctt aaagatctca 180 
aatacttgcc accttaaatt agaataaggt 240 
ttcatagaca gaaaaaggtc tgaagaaagc 300 
ttttaaagct ggttaaagtt gcaaaagaca 360 
gttggttaac atcaggttac tcttttgtaa 420 
gattagattt ttaggttatt aggttattat 480 
taagaatgta gtttctgttt gatgacttga 54 0 
tttagtctgt tctgtttggg cctagtgcag 600 
atttaaaaga ctttaaagaa catacatgag 660 
attaatttat ttgattggtt aagtcttggc 720 
caatccttgg ctattattct ccttatagtt 780 
atcctatggg ttttaaatgc tttcctgcag 840 
aagagacaac ctgaagaaac caacaatgac 900 
ctgaacccca gggcacctac tgacgatgat 960 
gaccatgtca acagtaataa ctaaagagta 1020 
tgctgagtgg atgacaaaaa gaggggaatt 1080 
aaattcctga gcaaacaaaa ccacctggcc 1140 
acaaacacaa gcaaaacttc acggggtcat 1200 
tggatcattt ctggtaaatg cttatgttag 1260 
gcctactaac atataattag gtgactaggg 1320 
acaattatgt aattgaaaac caaccgattg 1380 

1408 



<210> 3 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<400> 3 

cccctcagga gcgcgttact tcacaccttc 
ggcagggcgg gcggccagga tcatgtccac 
gtccatcctg gggctggccg gctgcatcgc 
ggacctgtac gacaaccccg tcacctccgt 



ggcagcagga gggcggcact tctcgcaggc 60 
caccacatgc caagtggtgg cgttcctcct 120 
ggccaccggg atggacatgt ggagcaccca 180 
gttccagtac gaagggctct ggaggagctg 240 
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cgtgaggcag agttcaggct tcaccgaatg 
agccatgctg caggcagtgc gagccctgat 
cctcctggta tccatctttg ccctgaaatg 
caaagccaac atgacactga cctccgggat 
tgctggagtg tctgtgtttg ccaacatgct 
catgtacacc ggcatgggtg ggatggtgca 
ggctctgttc gtgggctggg tcgctggagg 
catcgcctgc cggggcctgg caccagaaga 
ctcaggccac agtgttgcct acaagcctgg 
caacaccaaa aacaagaaga tatacgatgg 
ttatccttcc aagcacgact atgtgtaatg 
actcccggag agctcaccca aaaaacaagg 
actcacagct ggaagttaga aaagcctcga 
gcctcagtct ctgtctctaa atattccacc 
ggctatagct cacattttca atcctctatt 
tgagagaatg tggttttaat ctctctctca 
ttcctcctag tcaataaacc cattgatgat 
ttgaaaggaa agagtagacc caaagatgtt 
cccccaactt ggctagtaat aaacacttac 
gtaatctctc cagcccatga tctcggtttt 
caaagtcatt ttcagtttga ggcaaccaaa 
acagcaacac cattctagga gtttcctgag 
gtcagaaatt gtccctagat gaatgagaaa 
agttaaaata aataatgttt tagtaaaatg 
ctacatgtgg atagaaggaa atgaaaaaat 
tgtaaagtca tgcttaagta caaattccat 
ttgaggtctc tatggctctg attgtacatg 
taatgtctg 



caggccctat ttcaccatcc tgggacttcc 300 
gatcgtaggc atcgtcctgg gtgccattgg 360 
catccgcatt ggcagcatgg aggactctgc 420 
catgttcatt gtctcaggtc tttgtgcaat 480 
ggtgactaac ttctggatgt ccacagctaa 540 
gactgttcag accaggtaca catttggtgc 600 
cctcacacta attgggggtg tgatgatgtg 660 
aaccaactac aaagccgttt cttatcatgc 720 
aggcttcaag gccagcactg gctttgggtc 780 
aggtgcccgc acagaggacg aggtacaatc 840 
ctctaagacc tctcagcacg ggcggaagaa 900 
agatcccatc tagatttctt cttgcttttg 960 
tttcatcttt ggagaggcca aatggtctta 1020 
ataaaacagc tgagttattt atgaattaga 1080 
tcttttttta aatataactt tctactctga 1140 
cattttgatg atttagacag actccccctc 1200 
ctatttccca gcttatcccc aagaaaactt 1260 
attttctgct gtttgaattt tgtctcccca 1320 
tgaagaagaa gcaataagag aaagatattt 1380 
cttacactgt gatcttaaaa gttaccaaac 1440 
cctttctact gctgttgaca tcttcttatt 1500 
ctctccactg gagtcctctt tctgtcgcgg 1560 
attatttttt ttaatttaag tcctaaatat 1620 
atacactatc tctgtgaaat agcctcaccc 1680 
aattgctttg acattgtcta tatggtactt 1740 
gaaaagctca ctgatcctaa ttctttccct 1800 
atagtaagtg taagccatgt aaaaagtaaa 1860 

1869 



<210> 4 

<211> 624 

<212> DNA 

<213> Homo sapiens 

<400> 4 

agcgcagtgg ccactatggg gtctgggctg 
agctcacatg gaacaggtga gggctagagg 
aggccagaga aaaggggtgg gacttcatgg 
tgagcttcca agaggctctg gaggggcatt 
tgagcaggaa ggttctgtgt ctccggagga 
acttctggat ccgcagggcc gggtatgact 
acaaattcct cctatgagtc cagcttcctg 
catctccctt cagggaccag cgtcaccctc 
tgcaacacat gacagccatt gaagcctgtg 
atgcaggagg caggccccga ccctgtcttt 
aaataaaatt cggtatgctg aatt 



ccccttgtcc tcctcttgac cctccttggc 60 
gcaggactcc tgggtccctg tggcaagaag 120 
tccctgagag tgacagagac accccagtcc 180 
gctggggaag aggaactgtg ccggggagcg 240 
atcagccctg actgctgggt cctaagctgt 300 
ttgcaactga agctgaagga gtcttttctg 360 
gaattgcttg aaaagctctg cctcctcctc 420 
caccatgcaa gatctcaaca ccatgttgtc 480 
tccttcttgg cccgggcttt tgggccgggg 540 
cagcaggccc ccaccctcct gagcggcaat 600 

624 
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<210> 5 
<211> 5746 
<212> DNA 

<213> Homo sapiens 



<400> 5 

cgcgacccag gcgcgggttc ccggaggaca 
ctgattggtt gtgggtggct acctcttcgt 
agcaagggtc tgaagcggaa acgggaggag 
tcctggtggc tagatcctgg ccacacagcg 
agctccctct ttgacctctc agtgctcaag 
gacctgcggc acctggtgct ggtcgtgaac 
cccgcggctg ccctgccacc tgtgcctagc 
ttactggcaa gctcggacgc tgccctttca 
agccacattg agggcctgag tcaggctccc 
cgtagcatcg ggggagcagc gcccagcctg 
ggctgtctac tggacgatgg gcttgagggc 
gacaatgaac tttgggcacc agcctctgag 
ggcaaggagg aagctccgga gctggacgag 
gctccgggca tgaccctcac agccacgggc 
cccctctact accacctaca aggttcaggc 
tgtgtatcag tcctggagcg ccggacccag 
gcagcaagaa cccccacgga agggcgtgag 
tgggcctcct gttaggagga agtgcctgca 
atgcagaact gaagctggtt ctgcagcaga 
aggtgcctcc cagcaacgcc atggaggcca 
cggagttggt ggaaattatc gtggagacgg 
tagcgggcgg cggcaaagag ggaatcttcg 
ccaggagcct cagcctgcag gaaggggacc 
acttcaagta cgaggacgca ctacgcctgc 
tctgcctgaa gcgcactgtg cccaccgggg 
gctacgagat caagggcccg cgggccaagg 
ctgtgaagaa gaagaagatg gtgcctgggg 
ttgacgtcga gttctccttt cccaagttct 
ctgtcaaggg tcctgtcccg gctgcccctg 
gtgtacgaga agtggccgaa gaggctcagg 
ccaggaaagc caaggtggag gctgaggtgg 
tggagctggt tgggccgcgg ctgccagggg 
ccaaggctgc cccctcagca gaggcagctg 
ggctcggagc cccggctccg cctgctgtgg 
aggtggagct gcctgccttg ccctcactgc 
cccgggaagg ggctgtgtcg gtagtggtgc 
gggtggacct ggccttgccg ggtgcagagg 
ccctgaagat gccccgcctt agttttcccc 
aggccaaggt agccaaggtc agccctgagg 
ccacctttgg gctttccctc ttggagcccc 
agctgaagct gcccaccatc aagatgccct 



gccaacaagc gatgctgccg ccgccgtttc 60 
tctgattggc cgctagtgag caagatgctg 120 
gaggaggaga aggaacctct ggcagtcgac 180 
gtggcacagg cacccccggc cgtggcctct 240 
ctccaccaca gcctgcagca gagtgagccg 300 
actctgcggc gcatccaggc gtccatggca 360 
ccacctgcag cccccagtgt ggctgacaac 420 
gcctccatgg ccagcctcct ggaggacctc 480 
caacccttgg cagacgaggg gccaccaggc 540 
ggtgccttgg acctgctggg cccagccact 600 
ctgtttgagg atattgacac ctctatgtat 660 
ggcctcaaac caggccctga ggatgggccg 720 
gccgaattgg actacctcat ggatgtgctg 780 
ctgggacaga gagctgatga cccaggagac 840 
ttctcgtgtc cccagctcag gactctgtgc 900 
gaggcccaag gagctggagg tgaccctcag 960 
cctggcagac agctgtgcgg cacctcgggc 1020 
cccaggcagc ggctcagagg cagctgctcc 1080 
aaggggagag gacacaggag cctggggtgc 1140 
ggagccggag tgccgaggag ctgaggcggg 1200 
aggcgcagac cggggtcagc ggcatcaacg 1260 
ttcgggagct gcgcgaggac tcacccgccg 1320 
agctgctgag tgcccgagtg ttcttcgaga 1380 
tgcaatgcgc cgagccttac aaagtctcct 1440 
acctggctct gcggcccggg accgtgtctg 1500 
tggccaagct gaacatccag agtctgtccc 1560 
ctctgggggt ccccgctgac ctggcccctg 1620 
cccggctgcg tcggggcctc aaagccgagg 1680 
cccgccggcg cctccagctg cctcggctgc 1740 
cagcccggct ggccgccgcc gctcctcccc 1800 
ctgcaggagc tcgtttcaca gcccctcagg 1860 
cggaggtggg tgtcccccag gtctcagccc 1920 
gtggctttgc cctccacctg ccaacccttg 1980 
aggccccagc cgtgggaatc caggtccccc 2040 
ccactctgcc cacacttccc tgcctagaga 2100 
ccaccctgga tgtggcagca ccgactgtgg 2160 
tggaggcccg gggagaggca cctgaggtgg 2220 
gatttggggc tcgagcaaag gaagttgctg 22 80 
ccagggtgaa aggtcccaga cttcgaatgc 2340 
ggcccgctgc tcctgaagtt gtagagagca 2400 
cccttggcat cggagtgtca gggcccgagg 2460 
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tcaaggtgcc caagggacct 
aagtgcccga ggcagccctt 
cagagatgaa actcccaaag 
tagagctgcc caaagtgtca 
aggtgcggct tccagaggta 
cagagatggc tgtgccggag 
tgaaactccc agaggtgtca 
tgccgaaagt gccagagatg 
aacttcctga gatgaaactc 
ccgatgtgca cctcccagaa 
tgaaactccc tgaggtgaaa 
tcccggaagt gcagctcccg 
ctgtgccaga ggttcgactc 
ccaaggtgcc tgaaatggcc 
tctgtgaaat gaaagtccct 
agatggctgt gcccgatgtg 
ggctgccgga aatgcaagtg 
tgaagctgcc cagggctccg 
ggatggaatt tggcttcaag 
ccccatcacg tggcaagcca 
ttccctgtct gcagccagag 
tgccttcagt ggagctagac 
ctaaaatggg caagggagag 
tgggcttccg agtgccctct 
aggaagggcg gctggagatg 
ctaagtttgg actctcgggg 
ctaccaagct gaaggtatcc 
aggctgaggc caaaggggct 
cacagctcag cctggatgcc 
tcaagttcaa ggggcccagg 
aggcagcaga actagtgcca 
ggagggtgaa gatgcccaag 
cagaagttca aggtgatcgt 
ttaagatccc cgaggtggag 
gggctgtggc cgtcagtgga 
tggtcactga gggccatgac 
aggtggagct gaccggcttt 
tcccttcagc agagggcaca 
tgcctggagc ccaggttgca 
ccaccgtgac agtgccccag 
gcgaggcggc cacaggcgag 
ctagggtggg gggcgagggt 
tctcactgcc cgacgtggag 
cagaggggga gggagaggcc 
tgcgggccaa ggagggggcc 
cccgagtggg cttcagccaa 
aggaggagga ggaggaggaa 
tccgggtccg cttgccacgt 



gaagtgaagc tccccaaggc 
ccagaggttc gactcccaga 
gtgccagaga tggctgtgcc 
gagatgaaac tcccaaaggt 
cagctgctga aagtgtcgga 
gtgcggcttc cagaggtaca 
gaggtggctg tgccagaggt 
aaagtccctg agatgaagct 
cctgaagtgc aactcccgaa 
gtgcagcttc caaaagtccc 
ctcccgaagg tgcccgagat 
aaagtcccag agatgaaact 
cccgaggtgc agctgccaaa 
gtgcccgatg tgcacctccc 
gacatgaagc tcccagagat 
cacctccccg aggtgcagct 
ccgaaggttc ccgacgtgca 
gaggtgcagc taaaggccac 
atgcccaaga tgaccatgcc 
ggcgaggcgg gtgctgaggt 
gtggatggtg aggctcatgt 
ctgccaggag cacttggcct 
cgggcggagg gccccgaggt 
gttgaaattg tcaccccaca 
atagagacaa aagtcaagcc 
ccaaaggtgg ctaaggcaga 
aaatttgcca tctcactccc 
ggggaggcag gcctgctgcc 
cacctgccct caggcaaggt 
tttgctctcc ccaagtttgg 
ggggtggctg agttggaggg 
ctgaagatgc cttcctttgg 
gccagcccgg gggaaaaggc 
ctggtcacgc tgggcgccca 
atgcagctgt caggcctgaa 
gcggggctga ggatgcctcc 
ggggaggcag gtaccccagg 
gcaggctaca gggttcaggt 
ggtggtgagc tgctggtggg 
cttgagctgg acgtggggct 
ggtgggctga ggctgaagtt 
gctgaggagc agcccccagg 
ctctcgccat ccgggggcaa 
ggacacaagc tcaaggtacg 
gaggagggtg agaaggccaa 
agtgagatgg tcactgggga 
gagggcagtg gggaaggggc 
gtaggcctgg cggccccttc 



tcctgaggtc aagcttccaa 2520 
ggtggagctc cccaaggtgt 2580 
ggaggtgcgg cttccagagg 2640 
gccagagatg gctgtgccgg 2700 
gatgaaactc ccaaaggtgc 2760 
gctgccgaaa gtgtcagaga 2820 
gcggcttcca gaggtgcagc 2880 
tccaaaggtg cctgagatga 2940 
ggtgcccgag atggccgtgc 3000 
agagatgaag ctccctgaga 3060 
ggctgtgccc gatgtgcacc 3120 
ccctaaaatg cctgagatgg 3180 
agtctcagag atgaaactcc 3240 
agaggtgcag ctgcccaaag 3300 
aaaactcccc aaggtgcctg 3360 
gccgaaagtg tcagagattc 3420 
tcttccgaag gcaccagagg 3480 
caaggcagaa caggcagaag 3540 
caagctaggg agggcagagt 3600 
ctcagggaag ctggtaacac 3660 
gggtgtcccc tctctcactc 3720 
gcaggggcag gtcccagccg 3780 
ggcagcaggg gtcagggaag 384 0 
gctgcccgcc gtggaaattg 3900 
ctcttccaag ttctccttac 3960 
ggctgagggg gctgggcgag 4020 
caaggctcgg gtgggggctg 4080 
tgccctcgat ctgtccatcc 4140 

agaggtggca ggggccgacc 4200 

ggtcagaggc cgggacactg 4260 
caagggctgg ggctgggatg 4320 
gctggctcga gggaaggaag 43 80 
tgagtccacc gctgtgcagc 4440 
ggaggaaggg agggcagagg 4500 
ggtgtccaca gccaggcagg 4560 
gctgggcatc tccctgccac 4620 
gcagcaggct cagagtacag 4680 
gccccaggtg accctgtctc 4740 
tgagggtgtc tttaagatgc 4800 
aagccgagag gcacaggcgg 4860 
gcccacactg ggggccagag 4920 
ggccgagcgt accttctgcc 4980 
ccatgccgag taccaggtgg 5040 
gctgccccgg tttggcctgg 5100 
gagccccaaa ctcaggctgc 5160 
agggtccccc agccccgagg 5220 
ctcgggtcgc cggggccggg 5280 
taaagcctct cgggggcagg 5340 
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agggcgatgc agcccccaag tcccccgtca 
gggtgtccct aagccccaag gcccggagtg 
gggtgcggct gcccagcgtg gggttttcag 
agggggctca ggctgcggct gtctgaagcc 
cctttctcta ccccctcgct gttgtgtgtg 
ggaggtgggt gactgaccag ggctggcagg 
gcctgtaccc caccaagcca tgtgaataaa 



gagagaagtc acccaagttc cgcttcccca 5400 
ggagtgggga ccaggaagag ggtggattgc 5460 
agacaggggc tccaggcccg gccaggatgg 5520 
cctagtcaga tggggatccc ttcttgcctt 5580 
tgataactag cactaaccct aagagggccg 5640 
gaggcctgct cctgtctctc tggcaggagt 5700 
ataatctgga agcaaa 5746 



<210> 6 

<211> 1639 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> unsure 
<222> (1447) 

<220> 

<221> unsure 
<222> (1554) 



<220> 

<221> unsure 
<222> (1574) 



<220> 

<221> unsure 
<222> (1592) 



<220> 

<221> unsure 
<222> (1595) 



<220> 

<221> unsure 
<222> (1610) 



<220> 

<221> unsure 
<222> (1612) 



<400> 6 

ctagagcctg gggtctcggc aacttccggc 
actgcgcgtg cgcttcggcc cggctcctcc 
gtacggccgg gcggttggcg tcctctgcgg 
ccctttgaag gagtggcgac ggcccggaca 
gaggggtcct gcgctccgcc tggcgggggc 



ggcgggagct gcagagcgca aggcccgccc 60 
tgcgcccccg gcccctgcga ctgggacttg 120 
ctcctgccag gggcgggctt ttcaaatctt 180 
gttcgcgttg gagatggagg ggccgagcct 240 
ttcccaccca gcaggactgc aacattcaag 300 
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aaaaaataga cttagaaatt cgaatgcgag 
ctcagaaaga tcaagtttta catgcagtta 
tggcctatac atcggagcta cagaaattag 
gtgatgtgaa atttgaaagt aaagaacgaa 
atattcgaat accactaatg tggaaagact 
gacgctatgc cattttttgt ttattcaaaa 
tgaatgtgga taaaacaatc acagatatat 
tttttaatct tcagagaata aaaataattt 
attattggtt ctttggattc attttatgtt 
tatgatttta aaaattattt tgttcagaag 
tgtgaaataa aaatggaaat cttgtaatta 
tatgacccat ttttaaattg ttaataaata 
aacagaatat cctgtaatgt tatttgatat 
tatctcatgt aggatatttg gttgcagaaa 
agatagtcct gaagtacatg ctatatagga 
ttcttatggt gcacttcttt catgtacttc 
ggctgttttt attgtccctg ctcttttaca 
caagaaggaa ctccttgggt agccatagaa 
cctccttcaa aggttctatg tgcctaaatc 
gacacgntct ttagatctaa atgttaatag 
catggttttt gccattttca gctatggagc 
tgccactcac atancattaa aaaaacctat 
gtgtctcata gtaatagta 



aaggaatatg gaaactcctt tctctgagca 360 
agaatctcat ggtgtgcaat gctcgactaa 420 
aagaacagat tgcaaatcag actggaagat 480 
cagcatgtaa aggaaagatt gccatatcag 540 
ctgatcactt cagcaataaa gaacgatcac 600 
tgggagctaa tgtgtttgat actgatgtgg 660 
gttttgaaaa tgtaaccata ttgtaagtat 720 
aaaattcttc ttttttaaaa gaaagttctt 780 
taaatgttta agtgatcttt aaatgtttaa 840 * 
aagtccattt ctctatctgc agttttctga 900 
ctattagcag taaatatttg acttattaga 960 
tagttcagtt attaacaaag ctatgcatac 1020 
agagagaatt taagcataaa acaggatttt 1080 
tactaaaata gtatagcgac tttatttaca 1140 
agagcacttt gaaattttgg ggtgttcttt 1200 
aaagcaataa aaaaaaatgg gtgatctcag 1260 
ggctcatttt attgtggtca taatacagaa 1320 
atcattttta acttacatag tttttcctgc 1380 
agtgtgggat ttgtatttta gacttttaaa 1440 
ctactaacta ttaatataaa aatccatgtg 1500 
tagacaggtg agattttaga ggnctagttt 1560 
ancanaccat attttgtagn tnctggtcca 1620 

1639 



<210> 7 

<211> 865 

<212> DNA 

<213> Homo sapiens 



<400> 7 

gtgggtaatt tccactcttt gtctgtagtt 
aaatatgaca aaagttattt tataaataac 
gaaaagctaa ggcttgtttg gctttttgtt 
taacctcatc ccagtgagta gagactggga 
ggagcgcttg tgacggagag gagctatgga 
ggcaggcctg ggcccgctga cttcagggtg 
atctatttat ttactgagat ggagtcttga 
cacagtctga aagctggtga gatagatgta 
gcagcaggag gctacagtgg gccctatgca 
tcctttgcac ttcctggcta tcttgctttt 
atggagtatt gtattatcat tgttgataaa 
tctttctttg ttccctgact ttgtttgcac 
gtacttttcc acctctgcag taaaaaatcg 
gagcagatgc taacaggtta tgaaactatg 
gaagttgctg ccttgtcaac ttaga 



tctgaatttc ttacaagaaa catgtataag 60 

agggacactt ccaggcattt cagtctttaa 120 

tatttttagg tttttggtgt cctcatgacc 180 

ggggagagca gcagctggat gggcaggctg 24 0 

cgtctgcttc tctgccaagg gagagagtga 300 

aggccacagc tactgcagcg ctttttatct 360 

tccacattag tcaatttggc atagctagtg 420 

gagttgccaa attttcaatt tatctattag 480 

aaacaattca tgtagcatta tgggaatttg 540 

atgtgcattt attactaaga agttgtactc 600 

ataataatga tattttgcag tcaccatgca 660 

aggaaaatta aagaaacaaa ttgccgttta 720 

tcaggaaagc acaagctcag aattatcaat 780 

caaatcaaag tacacttgaa caaatgaact 840 

865 
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<210> 8 

<211> 2929 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cgggagctgt ggaccttcgc gggttcccgg gacccgagcg caccgcggct agcctacggc 60 
tacggcccgg gcagcctgcg cgagctgcgg gcgcgcgagt tcagccgcct ggcaggaact 120 
gtctatcttg accatgcagg tgccaccttg ttctcccaga gccagctcga aagcttcact 180 
agtgatctca tggaaaacac ttatggtaat cctcacagcc agaacatcag cagcaagctc 240 
acccatgaca ctgtggagca ggtgcgctac agaatcctgg cgcacttcca caccaccgca 300 
gaagactaca ctgtgatctt cactgccggg agcacggctg ctctcaaact ggtggcagag 360 
gcctttccat gggtgtccca gggcccagag agcagtggga gtcgcttctg ttacctcacc 420 
gacagccaca cctccgtagt gggtatgagg aacgtgacca tggctataaa tgtcatatcc 4 80 
atcccggtca ggccagagga cctgtggtct gcagaggaac gtggtgcttc agccagcaac 540 
ccagactgcc agctgccgca tctcttctgc tacccagctc agagtaactt ttctggagtc 600 
agataccccc tgtcctggat agaagaggtc aagtctgggc ggttgcgccc ctgtgagcac 660 
gcctgggaag tggtttgtgc tgctggatgc aggcctccta cgtgagcacc tcgcctttgg 720 
acctgtcagc tcaccaggcc gactttgtcc ccatctcctt ctataagatc ttcgggtttc 780 
gtacaggcct gggggcatct gtgggtccat aatcgtgcgg ctcctctact gaggaagacc 840 
tactttggag gagggacagc ctctgcgtac ctagcaggag aagacttcta catcccgagg 900 
cagtcggtag ctcagaggtt tgaagatggc accatctcat tccttgatgt tatcgcgcta 960 
aaacatggat ttgacaccct agagcgcctc acaggtggaa tggagaatat aaagcagcac 1020 
accttcacct tggctcaata tacctacatg gccctgtcct ctctccagta ccccaatgga 1080 
gcccctgtgg tgcggattta cagcgattct gagttcagca gccctgaggt tcagggcccg 1140 
atcatcaatt ttaatgtgct ggatgacaaa gggaacatca ttggttactc ccaggtggac 1200 
aaaatggcca gtctttacaa catccacctg cgaactggct gcttctgtaa cactggggcc 1260 
tgccagaggc acctgggcat aagcaacgag atggtcagga agcattttca ggctggtcat 1320 
gtctgtgggg acaatatgga cctcatagat gggcagccca caggatctgt gaggatttca 1380 
tttggataca tgtcgacgct ggatgatgtc caggcctttc t taggt teat cat agacact 1440 
cgcctgcact catcagggga ctggcctgtc cctcaggccc atgetgacac eggggagact 1500 
ggagccccat cagcagacag ccaggctgat gttatacctg ctgtcatggg cagaegtage 1560 
ctctcgcctc aggaagatgc cctcacaggc tccagggttt ggaacaactc gtctactgtg 1620 
aatgctgtgc ctgtggcccc acctgtgtgt gatgtcgeca gaacccagcc gactccttca 1680 
gagaaagctg caggagtcct ggagggggee cttgggccac atgttgtcac taacctttat 1740 
ctctatccaa tcaaatcctg tgctgcattt gaggtgacca ggtggcctgt aggaaaccaa 1800 
gggctgetat atgaceggag ctggatggtt gtgaatcaca atggtgtttg cctgagtcag 1860 
aagcaggaac cccggctctg cctgatccag cccttcatcg acttgeggea aaggatcatg 192 0 
gtcatcaaag ccaaagggat ggagectata gaggtgeetc ttgaggaaaa tagtgaaegg 1980 
actcagattc gecaaagcag ggtctgtgct gacagagtaa gtacttatga ttgtggagaa 2040 
aaaatttcaa gctggttgtc aacatttttt ggccgtcctt gtcatttgat caaacaaagt 2100 
tcaaactctc aaaggaatgc aaagaagaaa catggaaaag atcaacttcc tggtacaatg 2160 
gccacccttt ctctggtgaa tgaggcacag tatctgetga tcaacacatc cagtattttg 2220 
gaacttcacc ggcaactaaa caccagtgat gagaatggaa aggaggaatt attctcactg 2280 
aaggatctca gcttgcgttt tcgtgccaat attattatca atggaaaaag ggcttttgaa 2340 
gaagagaaat gggatgagat ttcaattggc tetttgegtt tccaggtttt ggggccttgt 2400 
cacagatgcc agatgatttg catcgaccag caaactgggc aacgaaacca gcatgttttc 2460 
caaaaacttt ctgagagtcg tgaaacaaag gtgaactttg gcatgtacct gatgeatgea 252 0 
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tcattggatt tatcctcccc atgtttcctg 

aaagagaatg tggaaggtca tgatttacct 

taaaaaaaat ttttagcata cattaaagtt 

agatctgcaa cttggttcag tagaacttga 

ag a 9gcaggg aatgctctca cctgcttcct 

cactggctgt gctcaggaga gcacttctga 

cgtgaggctc ctgtagtatt tgaagtataa 



tctgtaggat ctcaggtgct ccctgtgttg 2580 
gcatctgaga aacaccagga tgttacctcc 2640 
tctcttttac agtgatctct attattgtta 2700 
tgttttgaat aaggagagct ctttttcttt 2760 
tctgcctttg acttctcacc ctgcaatttg 2820 
ggcctcagga acgaatgctg cacccacatc 2880 
gcgttgaggg ggtccttgc 2929 



<210> 9 
<211> 1205 
<212> DNA 

<213> Homo sapiens 
<400> 9 

ggtgctgttc tgatcggtgt ggtttgtgtt ctctggtcgt ggttttccgt tgttgttgtt 60 
ttgtgcttgt ttgtggttgc gtgtgtgtgt ctgtcttgtc tgtcatgtgt ctctttcttc 120 
gtttctgtgt gttttgctct ttcgtgtggt cgttgtttgt tgcgcccgca gttcgtctct 180 
ccattttgct tagatatgca cctcagtggg tgtgtttata tacacactgt gtgctcgtat 240 
attgttcgtg gaggatctgg tatgatattt catcgcgcgg ctcgctccgc gtatctatgg 300 
ggtcgtgatg tgttcccgcg cgcagagaga tgttcttgag cccacacgtc ctctgggtga 360 
cccccaagtg attaaccgtt tgtgtgcgtc tctcatggtg attctcatct ggttgtattg 420 
gcgccccaca ttgtggccca cacttttgtg catctttgct ctctcttgct ggtgttgtgt 480 
ctctcgcgca ctctctgctg tgcttatgat agtagagatt tgcttctcct ctgtcgtggg 540 
tgttgttttt tttctttttt ttgtgtgtgg ttttttgttt acgcgagatt ggtcgtttca 600 
cggtgagggt ccctgttcac aatgcactgt taagtcccag tccacgttgg aagtggtcca 660 
attcgtttct gtttcttttc tttctttctt tttttttttt tttgagatag agtctcactc 72 0 
tgtcacccag gctggagtgt agtggcacta tctaggctca ctgcaacccc ccacctccca 780 
ggttcaagca attatgctgc ctcagcctcc caagaagctg ggacttcagg catgagccac 840 
cacacctgga taattttttg tattttttta gtagagacgg ggtttcacca tgttggccag 900 
gctggtctag aactcctgag ctcaagtgat tctccgccca ccttggcctc ccaaagtgct 960 
gggattacag gcatgagcca ccacgcccag cctgcaacgc tttctttttg ccctcttgtt 1020 
tatcagtttg tgtcatattt acacagcaaa gcctagtggc taaaagcacg agccacggag 1080 
caggctgcct aggttcttat ctgagctctg ccactagctg gcttaaagca gagctgcggc 1140 
ctctattttt tcattggtaa attaaggcca atgatcatat atacctcaca cgatggctgt 1200 
gagaa 1205 



<210> 10 

<211> 3327 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> (491) . . (509) 

<400> 10 
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atttttatac cataacttga gtgtattgcc 
agtttatatc ccagaaacat tgagccatca 
acctggctag gtagggaggg ggtggttatc 
cctctgtgca gataatacct ttttcttgct 
tctttcttgc aagtgcatct ttttccttcc 
cctagattgc agtgtgtcct gtggacaggc 
gtttacaaaa atgaattttt cctggtttcc 
nnnnnnnnrm nnnnnnnnng gtctggagac 
aaggtggtgg ttctagcttg gtctctgttg 
ttccttgact gttcattttt ttcctgcccc 
tgtggaattt cctctttgag gagcctgggc 
atgattactt tagacctagc ccaggcttgg 
ctggcctgta gagttagaac taccatttcc 
ttgctatgca aaacaatcta tcccaggttc 
cacaaaacgt agcacaaaca ttcattatgg 
cctttacttt tttcctgctg gctacagcat 
agaacagaat ggagggctct gggaggaggc 
aagtgcctaa agagagtgat gctaacactc 
gtctacttcg tgttctgtca ccctttgggg 
catgttctcc acttggatac attttggggc 
agtcattcac cagcatttgc aaatgtccat 
aacaagtttg tgttctctcc ttttctctct 
tggggcttga aatgcatttt tagccctttg 
gcaagagaat cagctttggg caatgacaag 
aaagataatt tttgagactt gaaaaatacc 
tggtgcatgc agatggagaa gtggtgttgg 
taagtggtgc ttcttaccca agcttcaagg 
cttagttggg tctcttgttc cctttgtacc 
tttgcctggc ttcctttccc ttttcttcta 
aagcttatac tagagaagaa cgcagttgcc 
ttctggcatt ttccacacct gtccactcct 
aatctggttc tttttctctt tacctggggc 
gctatttggg tatcctgggt ttgagtgtta 
agaagagaat ggagagaatt tgaataaaag 
tttatccagt ccaacctgat ccattaggga 
gcctggggct actgttgctg ggaacttagg 
aatttgaaac ttccctaaaa agctcctaat 
tcctatttag ctaagcagca gtgtttttgg 
cagcactcaa gatgggcagc caagggtgca 
taaggctggt gggacagttt tggacctgga 
cttccccatc agggtagaaa aatcatctca 
attgggggac gttattttta tttatatatg 
aataccttcc ttcttaaaat ctgatcatgg 
ggccttctaa gcagattggg aaggaggtat 
aggacttgcc ttctccctgg gcagggagag 
tcatactgac ttagagcctc tggctgctgt 
tgagctaaaa acaaaacaga atgaggtggg 
ataggaaacc ctccaagaat tgtgcaagta 



aaaatttgga aatccttccc atgcctgatg 60 
gaatgaactg tgtacctgat ttgttctctg 120 
gccccaagat ggggtccagg ctccatcctt 180 
atagcctccc tcctctgcac tgtcctgcac 240 
cctggactgt cctctgaccc tttggctcat 300 
tggggaattt tgctgctccc tattgcttct 360 
cactagggca tgtgggtggg tggcatggac 420 
atggggtttg gctgtcttgc aggactggag 480 
gccttgaagc aagcatcccc cctgcccttt 540 
actgcttggg atggggagtt gcaacttcag 600 
ttggatctat cctgatctgg tgatgaagcc 660 
aggccagctg gaggaagaag ggtctaaatc 720 
tccccttagc tgcccttgta tgacccggat 780 
tgttctggtt ggctacattg ttcagcaact 840 
agaaagcatc aggactgtfcg agtaactcct 900 
ggggtgccct ataggcacaa gcccagctga 960 
agctcactgg agagcctaca ttccttacac 1020 
catctgccct gtccattgcc ttcatataca 1080 
aggggagttc tcctgggaca gtgggctctg 114 0 
taggatcagg gcactattcc tggagggtcc 12 00 
agggagcagg tggcagcctc tactcccagc 1260 
ttgcctcact ctctccagtt ggttttcagc 1320 
acgtggctta tgccattcaa gaaataaaaa 1380 
aaatgagttc ttactctgat ttttttgtaa 1440 
ccgaccttga gattattcct gtttgaaagg 1500 
cagcaagctt tggctcatgt ggatttggtt 1560 
aagtgcttgg gggaccccca gcctcatcct 1620 
actgttttgc cttccttttc ctcttctctc 1680 
ttcactctgc ttgcttgctg gactgccctc 1740 
cttgcccacc ttgtgtgaag tcaggagggt 1800 
tggagctggt ttctctcatt gctttttcta 1860 
ctggcttttc tgagattgtc ttagggttga 1920 
ggggatggac ataaaggaaa aagagtgatg 1980 
gtgggaaagg agagcactgt tctttgattg 2040 
tcgaggtgct acactggcct ccagggataa 2100 
cttaacataa agccgaagaa ggtacctaga 2160 
gcccacctgc tagatagctt ctctgtggcc 2220 
atactttttt tttctgtttg tgaataaggc 22 80 
ctgactatta gctggcccat aggatatctg 2340 
atcatgtgta actaacaagg ttggacgttt 24 00 
aactagccaa aaggcagttt tggaaactac 2460 
gggcctaggc caatccagga tggtagctgg 2520 
cagggatatg cagggcactt tttactattt 2580 
tttctggttt tcgctttcct ccgacttaat 2640 
aggctgggtt ggtgctctcc cttactctac 2700 
ttgggcatcc aagaaaggga ggggaaggaa 2760 
aaagggagat tttcttcttt acagaggaaa 2820 
aagacatttg ttgaatgcac tgagtccctt 2880 
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ggtgtagtag caataaggaa aaatgaaatt 
ggtatgtgat gttgcactta gcagccatgt 
ctttagtttc taaacttttt atccctctca 
atccccacag ctgtgtactt gtttgcattt 
gctttgagct gtaccttgtc cagtccattg 
ttccttctga agcaagcaac atcagcagca 
taaagacaac agtggcttct atttctaaaa 
aaaaaaaaaa aaaaaaaaaa aaggcgg 



actttcctgt gcacacagtc cagcctaatt 2940 
ggtgggcatg tgtgactact ctggttttca 3000 
agtccagcat ggatggggaa atgtctctgg 3060 
gtttcccttt gagatttgtg tttgtgtcct 3120 
tgaaattatc ccagcagctg taatgtacag 3180 
gcagcagcag cagcacaatt ctgtgtttta 3240 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300 

3327 



<210> 11 
<211> 697 
<212> DNA 

<213> Homo sapiens 
<400> 11 

ggccctagtc caatataact gttgtcctta 

acacaggggg aatgtcatgt gaagattgga 

gaaggtagaa gagaggccta gaacagatcg 

accctcccaa cacattgatc ttggacttcc 

ttgttgttta taagcccccc agtttgtgga 

atataatgta caatcctttg tatatattac 

gtttttatag ctgcatgcat aacagatatt 

tttgtccttg cccaaatctc atgttgaaac 

tggttggagg tgtctggatc ttgggggagg 

tagcgagttc tcgggaggtc tggtcattta 

gttcttgcca tgtgagatgc ctgctcccac 

caggcatccc cagaagctga gcagatgcca 



taaaaagggg aaatttggat atagacacat 60 
gttatgctgc cacaaaccga gaaactacca 120 
ttccctagtg ccctcaaaag gaaccaacca 180 
cagcttccag aacagtgaga caataaattt 240 
act tea tt at ggcagccctg gcaaacttat 300 
tggatttgat ttgetagtat tttgetgagg 360 
ggtctatact tctctgatat agtctggata 420 
aaaataaccc cgcatattgg agatggggee 480 
atccttcatg gcttggtgtt gtcattgega 540 
aaagtgtgtg gcatctcccg cctctctccg 600 
ttcctcttct gecatgagta aaagctccct 660 
gtgecat 697 



<210> 12 
<211> 1221 
<212> DNA 

<213> Homo sapiens 
<400> 12 

ttaaataact tggaaaaact actggactgg 
ttggcactct aggacccatc ccttttaaca 
acctaagctg tattacaaat atagttccag 
ttcctgatac ctttaaccaa tgctctggcc 
tttggcttat gctatcctca gcagcacatg 
atactaccca atcaagactc tagtataaat 
tggccatttt cccaaagtac attttccact 
gtagttttat gtgtaaagct cagaggactg 
aaatgccttg cattgtacta tactgaaggt 
ttcttttctc tttcaggatc gaatcatcaa 
gattctagta aegctgataa gtgcttttgt 
tatattcttt getgtctgea tctctttgag 



atggtctctt aagattgcaa gtgccaagcc 60 
gtagcatcta tttttagact agacaaggat 120 
acacatcatg gtactctaca tetagtcegg 180 
tggccattcc tgcagtgcct gtccttctcc 240 
cattatttgt aagcctccct tacccatttg 300 
gttctgtccc ttcaagaaac cttccccagt 360 
cttatggttc agtacacttt gctttgtctg 42 0 
gafccttgggt ttctttatag taaaccatcc 480 
aacatggatc caagtcatat ggcttaaaaa 540 
tttagttgtt ggcagcttaa catccttatt 600 
tttccctcaa ctacctccaa aaccgttgaa 660 
tagtattact gcctgcatac ttatctactg 720 
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gtatcgacaa ggagacttag aaccgaaatt 
tatcatcatg ttgtgtatat gtgcaaacct 
ccaaggagaa gtacttacca ggactcttca 
tggataaggt atgctgaaga atctcctgca 
gtaaatgtta attccctctt gcaagggaga 
aggagctgat gttgcaccta aacattccaa 
tcacttttga aatgaaattt ttataattgt 
gcatcttaag acaaatattc ttttatttct 
caaccaactt ttgcttataa c 



tagaaagcta atttactata tcatattttc 780 
gtacttccat gatgtgggaa ggtgaggctg 840 
aaatgataca ttaggacagt gagtaatttt 900 
gaagtctgat acatgatttt catgttaatt 960 
catatcctag atcactttgc tttttcttta 1020 
cccttaaagc taaaacagca caaaaaaatt 1080 
atggcaaaag gctatgtaaa aacaaatctt 1140 
gttaaactga atatacaatt gttccctagg 1200 

1221 



<210> 13 
<211> 2238 
<212> DNA 

<213> Homo sapiens 
<400> 13 

ggtattcagc ggcgacagcg gcgactgcgg 
cttccgcaca ctgaagagta cgtcttcggg 
aatcagaagt ctgtctcgga tatgattaaa 
aactctgaga gaactactct atgtggtgca 
atggcagaga acaacaaaca gcacagtgga 
ttgacatgga aatacttgct ccatgagaaa 
actgaccatt atgaggacgt taggaagatt 
ttagatctga ttgatgttta tcaaaaatgt 
aacacagtat ctcctagtca actactggat 
gatgaaactg atctttctat accaacatca 
aaggtgcagc tgctagcaag gaaaattatc 
aagaatgacc tggctgtggc ttatattctc 
gccttcactg atttgaaaca tgctgctcga 
acgtctttta ttagaacaat agagcttgga 
cctttaagga cacatgtaaa gggattgtct 
gagattcttg gagaaatacc aaacccaagc 
aagatgcaac tgattaaagg ccaaaacagc 
gttgctcagg atttggattt gaggattaaa 
gctcttagca ccactgacat cagtcctgct 
actgcatact gtggcagaga tactgtgaaa 
gctaatgctc ctaccaaaaa caaagcagag 
catcatggaa cgtctattct tacacttttt 
aaacccctaa gagaacgcat ctgtgtgtca 
actttaatta gatcccaatt tgcttgtact 
aattggaata atgttaattt agcatcaaag 
ctttctgagg gtgtaaatcc atctgttgga 
gttcatctgg acagaagtaa aaatgaaaaa 
aataaaagct caaaaaggaa acaggtggat 
agaaatgaac cacctcaaca taaaaatgct 
aatagattgt acggcaaact agctaaagta 
aagttgattt ctggccaggc aaagttaact 



cggccgcggg agggcatccc gttggggatc 60 
tctaccccta atcacataat ggctgtgttt 120 
gagtttcgaa aaaattggcg tgctctttgt 180 
gactccatgc tcttggcatt gcagctttct 240 
gaatttacag tctctctcag tgatgtttta 300 
ttgaacttac cagttgaaaa catggacgtg 360 
tatgatgatt tcttgaagaa cagtaatatg 420 
agggctttga cttctaattg tgaaaattat 4 80 
tttctgtctg gcaaacagta tgcagtaggt 540 
ccaacaagta aatacaaccg tgataatgaa 600 
ttttcatatt taaatctgct agtgaattca 660 
aatattcctg atagaggact aggaagagaa 720 
gagaaacaaa tgtctatctt tttggtggcc 780 
gggaaaggat atgcaccacc accatcagat 840 
aattttatta atttcattga caaattagat 900 
attgcagggg gtcaaatact gtcagtgata 960 
agggatcctt tttgcaaagc aatagaggaa 102 0 
aatattatca attctcaaga aggtgttgta 1080 
cggccaaaat ctcatgccat aaaccatggt 1140 
gccttattag ttcttttgga cgaagaagca 12 00 
cttttatatg atgaggaaaa cacaatccat 1260 
aggtctccca cacaggtgaa taattcgata 1320 
atgcaagaga aaaaaattaa gatgaagcaa 13 80 
tataaagatg actacatgat aagcaaggat 1440 
cctttgtgtg ttctttacat ggaaaatgac 1500 
agatcaacaa ttggaacgag ttttggaaat 1560 
gtatcaagaa aatcaaccag tcagacagga 1620 
ttggatggtg aaaatattct ctgtgataat 1680 
aaaataccta agaaatcaaa tgattcacag 1740 
gcaaaaagta ataaatgtac tgccaaggac 1800 
cagtttttta gactataaat ttgtgtctta 1860 
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tatgctttag gtttatgtat ctataaacca 
atcaaggtgt aaattatgat gatttattat 
gttaagcatt gtttttgact ttttaaaaat 
cactttcaga taagaggtgt ttgctgggat 
gtcagtcgtc ctaatgcata ttgtgactgt 
gttttacttt tcagaggatt tgtaagaatc 
tcacattgaa aaaaaaaa 



ttcaccaaag 


acatgcttaa 


txtxtaagag 




tttggtctac 


agtgtatgta 


aggttagtat 


1980 


accttagatg 


caaatttata 


ggagaaaaaa 


2040 


ggaagaacta 


cctggcatgt 


aagaaatatc 


2100 


ttgcatatac 


ttctgtttat 


aaaagtatca 


2160 


atttaaattt 


tcattgaaat 


aaacgacaag 


2220 








2238 



<210> 14 

<211> 1769 

<212> DNA 

<213> Homo sapiens 



<400> 14 

tttttttttt ttgtattttt tgtagaggta gggtctcact ttgttgccca ggctggtcta 60 

gaactcctgg cttcaagcag tcctcccacc taggcctctc aaagtgctga gattacgggc 120 

atgagccaca tgcctggccc gtattatttt ttagtaaaat cactttccaa aatactgcaa 180 

tatgaggaaa cctttattcc aaaaagtcta ctcataataa cttataaaca tctttggaag 240 

ttaaaaatta accacatcaa cctgcttagc ccacataacc cacattaacc cacatcaacc 300 

tgcttagccc acataatcca cattaaccca tattgtggtt tatgttttaa aaggagaaaa 360 

aacactgaaa ctaccatatg tcttaccttt taggcataca tgttaaaatt ttggcagatg 42 0 

aaacataata ctgatagatg acgcttcaaa ataatgcagg gaagagtaga agtgggtaga 480 

gattgttaaa tcaagtttag tctaaagcag tctccttaca tatttgaagt tcagtctaaa 540 

ggtttctctg tacatagtga actataaatg tatctaaatg gaggtgtaaa cagactgtaa 600 

cctacttttg tgccaatcac caagttttgg ccagttaaaa ggggccaact gttcaaacca 660 

tgttcaaata aggcaaatgc cgagctgtaa ccaatctgac tgtttctgta cctctgtcta 720 

tacatcttct tccaccacct ggctgtgctg gagtctctct gaacatactg tggctcagga 780 

ggctgcccta ttcacgaatc attctttgct cagttgaact ctttaatttg actaaggact 840 

ttcttttaac aagatataaa ttacacaaac gaccataaat tataattgtt ttaaaatgcc 900 

acatgggagt tcaatatatt attctctcta cttttacata tgtttgaaag ttttataaaa 960 

gagagctttt gtttttttgt tgttgttttt tctgagacag tacaatctca gctcactgtg 1020 

gcctccacct catggactaa agagatcctc ccacctcagc ctcccaagct gggactacgg 1080 

ttgtgaacca ccatgcttgc ctacttttta aattttgtgt agagatgagg tctcactgta 1140 

ttgcctaggc tggtcttgaa ctcctagtct caagcaatcc tccctcctct gtcttccaaa 1200 

gtgctgggat tacaggtgtg agccactgtg ccttgccgaa tatgggtagt tttagacatg 1260 

ctcatggcag aatgatcaac aggcggaaga agtgggaggt ccagccgatg tggatccctg 1320 

agagcccatg tccacaaaca ggggaagaat agtggctaaa ataggtcttg tagggaattt 1380 

taaagacaag tgaattgtct gttgaggcag ccaaaaaggg gctggcttct gccaggtggc 1440 

agccaggcaa tgtccaggag aggagattgc caaggaaagg aggctacaaa tgccccctcc 1500 

ttgtgatgtc aggacctccc ttagcgagcg atctggccaa gacacaggga aaagacacag 1560 

gatccagacc cggggctctg ctccttggac ggctcagtgc agagagtcac tggctgcctg 1620 

gaaggagaga gtgggcaagg gtgtgaggga atctttgggg gctgtggaag ctgttctacc 1680 

ttatgaaatg gggctgggat ggactgagtg actatgctgt gctctgtcat ttgtccgtaa 1740 
gtactctcta catgctctaa taaaacatt 1769 



<210> 15 
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<211> 1094 

<212> DNA 

<213> Homo sapiens 

<400> 15 

gttcacaggg gactgttacc ttacagttgt 
aaaacagcaa atagcgctgc atttgatttt 
aaaaaccctg tgggttttta aaagcaagga 
tttgtaaatt tgattaacat acattgaatt 
acatataaca tgtcatttct ccccctctga 
atttcccaaa gtgatgaaga aaatgagggt 
gtttctcctt ggtctcgtct gtgtgactca 
ccacttctgt gctggatggg gcattttaat 
ctggatttat ggagggaaca gattcattga 
gtggatattc tggctatggt ggagagcttg 
tgcaatattt atctggtcat tggtgcaatt 
ccctgactgg ggagttgctt taggctggtg 
aattatggct atcataaaaa taattcaggc 
ttgctgcaga ccagcttcta actggggtcc 
taaagacatg gtagttccta aaaaagaggc 
cagaaaaccg gaatgagatc tcattgaaaa 
tagaataggg ggacccttat ttatttgtgt 
gttcatgata gggtgatttt tttcccattt 
ttttaaaaaa aaaa 



tatcgatgaa aaatcatata aagcagacta 60 
catcagaacc aagtggtgac tgccaaaaga 120 
acagggctca gagtctattt tcaacatata 180 
ctgtgattaa cagtattggc aaataagttg 240 
agaaacgatc acaacaacaa ttcaagattt 300 
tcccataact ttgggctgct gcttggtttt 360 
ggctggaatt tactgggttc atctgattga 420 
tgcagctata ctggagctag ttggaatcat 480 
ggatacagaa atgatgattg gagcaaagag 540 
ctggtttgta attacgccta tccttttgat 600 
tcatagacct aattatggcg caattccata 660 
tatgattgtt ttctgcatta tttggattcc 720 
taaaggaaac atctttcaac gccttataag 780 
atacctggaa caacatcgtg gggaaagata 840 
tggccatgaa atacctactg ttagtggcag 900 
aaatatatga ttgtataatg tgattttttt 960 
gttaactgaa taggaaaatg tacatactat 1020 
aagcaggaat gcaatataaa aatgtggttt 1080 

1094 



<210> 16 

<211> 1663 

<212> DNA 

<213> Homo sapiens 

<400> 16 

aacatatgat acataggtac caaattacct 
caaaatatat cttcttccag agatttctgg 
gatattattt ctctaaagtg aaaatcataa 
acagtcattt gcaggtttct ctctgtgtca 
caatctgctt agttaagccc atggacctaa 
gtcagtggaa ggtgcactag gcttttctga 
gaccagaatt caacaaagga tacagtcata 
aatgtggaaa ttagagttta aggcataact 
acaaagttat caaagtcagg aaagacgaaa 
ggagaggcaa atcaccagtg ttaaatcttt 
tctgaagttg gaattgacct gtcaaaaggc 
aaccaaattt cccaacagag atttatacca 
ccctttcctc aatttttagc atttttaaat 
aatttgatga atgaaaaata gtcatgtggt 
atattagcta tttgtatttc ttcttttagg 
aattataact ccttatactc aaatttatac 



ttttgaatta aaaaaaaata aactgaattc 60 
ataagtactt gtggatctct atgagcacat 120 
atcatatttt aggaatatcc tagttgtttt 180 
gctagcctga tcttaggtct tcttttcatc 240 
actgatttat ttctcatctg ttcagcctgt 300 
tcagccaagg ctaatcagag ttggagtaca 360 
aattagggtt aggtttgcaa atacttttat 420 
aatgaagcaa acagaaaata acatgggaaa-480 
atcattgttc ccgtctcata aggtaggact 540 
gtgcacattt taatttactt ataataaatt 600 
acagacattt tatagctttt gatatgcatt 660 
atttacattc tcccaggaat gtgtgaatgt 720 
tatttgccag tttatcattt atctttttct 780 
actttgcttt cattttttaa ttaccaatgc 840 
atccagtgta ttttgaattt ctagaactta 900 
acaaaaacac tccaaagaca gatgttagta 960 
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atttggctat gggcatgatt gaaaattgat ttctgaagta tacttggaaa tgtggtaata 1020 
aggttgtttt gagtggaata ttgttagaac atatttatat attataaaat attttttgga 1080 
tttcagaaga aaactttcac cttatttttt aatgttctaa gtctttactt tttaactact 1140 
acctttaaat tgagccttat ttataattgt cctatgaagt tatattgtat cattctgtgt 1200 
ttgttgcagt atcatttaat tgttttgtaa aaagctacat tgcaacacaa taaaatactt 1260 
caatgcttac aataggaagt cttgaaatag tatcctgaca tggtattaga aagtcttatc 1320 
tgcagaataa cacaaatgca caccaggaat ggggagggat gagggcggac cagagaccag 1380 
aagagctttg tttttatgag gagaaagaag ggaatcacgc tactcttgtt gactctttac 1440 
tccaagttca ttcttcattg ctaatgtctc caaatagtgc taccttagga ttgatttcca 1500 
gaatgtttct tgtttgtatt atfcagaaagt taaataagta ccattgtaat tttgaatata 1560 
ctttcaacag catggtagaa tatatgccat gtggtaatag tagtctttgt ttccatttaa 1620 
gctttggcaa atctctttta gtactaatta gtttaaaaaa aaa 1663 

<210> 17 

<211> 598 

<212> DNA 

<213> Homo sapiens 

<400> 17 

gcacgagagc aaaagcactt ttaaaatgag ttaagtggaa gacgaacaga caaatgaggt 60 
aacttctata aagatggagg ttgcttctgg ctcatcagcc atcgttgatg gtaaccagtc 120 
ctacaattct cattaggcag ttctgttacc ccactatttc cttggggtct gttgtgctaa 180 
ggacaggatt ggttgggtaa aggggtgtgg cacaaataac cctcaggaat acaggccaca 240 
gagctaatga agggccccaa ggaaagaaag acctgcccat cagcgatgaa ttctctcccc 300 
cagtgccaca gacctgaggg cacgtgaccc aggaatgtgc atccaaagat aatactacct 360 
tcagagaact ctacttatag gctgtggttt ttcaagaaaa aggaaaagat tcataattca 420 
ttgagctctt tccttgtgag aagaaaggcc actcttttgt gtgctgaagt tggacaacag 480 
ttcccaagga agctgaattc tagctgaata ttgttattgg gttttgcact atgcccttta 540 
tgttgtcatt aatcaataaa tacgtgtgga acaaatgatt aactagaaaa aaaaaaaa 598 



<210> 18 

<211> 1134 

<212> DNA 

<213> Homo sapiens 

<400> 18 

tcaaggtgtt gacagggctg tgttctcttc 
ccctttctgg tttctagagg ttgcccctgt 
cagtgccagc agcatcgtat ctgacccttc 
gaaaggttct ctgctgctgc tgagaacttg 
caggatcatc tctccatctg aagggccctt 
ccagggatta ggacgtggac atctttgggg 
cgacgtggac atctttgggg acattattct 
tcttggggca cattattctg tctatcacat 
ttattctgtc tcccacgggg attacgacgt 
acgctttata agcaaagctc acccaatttc 



taaaggctcc ggggagaagc catttccttg 60 
tccttggctt gtggcccctt cctccacctt 120 
ttccatcatc acatctcttt ctgacccaag 180 
tgtgattagt ctgggcccac ccggataatc 240 
ttgccaacta tggtaacata gccacaggtt 300 
accattattc tgtctatcac atggggatta 360 
gtctcccaca tggggattag gacgtggaca 420 
Sgggattagg atgtgacatc tttggggaca 480 
gagcatcttt ggggttgtct actgcccacc 540 
cttgttggac atggtgcttt caactcttaa 600 
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ttcctgagat gcgaacctct aataatgtga 
gtgaccgtgt aacctctccc caccctcacc 
aacctcacgg agaacctgca gtacgttctg 
acgctgccca accttccgcg gctgagcgcg 
tcacggtgca ccaacatcat cgcgggggac 
gtcatcgcgc tcaatcagaa gctgctgtgg 
gcggcggctg cagtttcacc cccgaatttc 
ttggtgatca taggaccgat gataatacgt 
ctgggcgtgg tgacttcgcc tgtcctccca 



ctaggaggga gaaacaggcg ggtgaggccc 660 
gttgcaggag ggttgttcgt ggccggcatc 720 
gcgcacccgt ccgagtccct ggagaagatg 780 
tgggtccgag agcagtgccc ggggccgggt 840 
ttcatcggcg cagacggctt cgtcagtgac 900 
tgctgacggg acccttctga agttcgggac 960 
caagtattgt gactttgttt gggccaaatg 1020 
tttcatttct ttaaaataga gatggggtgg 1080 
gagtgctggg atgacaagcg tgag 1134 



<210> 19 
<211> 2092 
<212> DNA 

<213> Homo sapiens 
<400> 19 

tttggccggg cccggcgcct gctggcctcc gcctcgtggg taccctgcat agtgctgggg 60 
ctggtgctga gctccgagga gctgcttacc gcgcagcccg cgccccactg ccgaccggac 120 
cccacgctgt tgcccccagc gctgcgcgcc ctgcgcggac ccgcgctgct ggacgccgcc 180 
atcccgcgcc tggggcccac gcgagccgcg agcccctgcc tgctcctgcg ctaccccgat 240 
cccgcgccct gcacccgccc cggcccgcgc cccgcgcccg cacgcaacgg cacccggccc 300 
tgcacacgcg gctggctcta cgcgctgccc ggcgccggcc tcctgcaaag cccggtcacc 360 
cagtggaacc ttgtgtgtgg agacggctgg aaggtcccgc tggagcaggt gagccacctc 420 
ctgggctggc tgctgggctg tgtcatcctg ggagcaggct gtgaccggtt tggacgccgg 480 
gcagtttttg tggcctccct ggtgctgacc acaggcctgg gggccagtga ggccctggct 54 0 
gccagcttcc ctaccctgct ggtcctgcgc ctactccacg ggggcacatt ggcaggggcc 600 
ctcctcgccc tgtatctggc tcgcctggag ttgtgtgacc ctccccaccg cctggccttc 660 
tccatggggg ctggcctttt ctcggtggtg ggcaccctgc tgctgcccgg cctggctgcg 720 
cttgtgcagg actggcgtct tctgcagggg ctgggtgccc tgatgagtgg actcttgctg 780 
ctcttttggg ggttcccggc cctgttcccc gagtctccct gctggctgct ggccacaggt 840 
caggtagctc gagccaggaa gatcctgtgg cgctttgcag aagccagtgg cgtgggcccc 900 
ggggacagtt ccttggagga gaactccctg gctacagagc tgaccatgct gtctgcacgg 960 
agcccccagc cccggtacca ctccccactg gggcttctgc gtacccgagt cacctggaga 1020 
aacgggctta tcttgggctt cagctcgctg gttggtggag gcatcagagc tagcttccgc 1080 
cgcagcctgg cacctcaggt gccgaccttc tacctgccct acttcctgga ggccggcctg 1140 
gaggcggcag ccttggtctt cctgctcctg acggcagatt gctgtggacg ccgccccgtg 1200 
ctgctgctgg gcaccatggt cacaggcctg gcatccctgc tgctcctcgc tggggcccag 1260 
tatctgccag gctggactgt gctgttcctc tctgtcctgg ggctcctggc ctcccgggct 1320 
gtgtccgcac tcagcagcct cttcgcggcc gaggtcttcc ccacggtgat caggggggcc 1380 
gggctgggcc tggtgctggg ggccgggttc ctgggccagg cagccggccc cctggacacc 1440 
ctgcacggcc ggcagggctt cttcctgcaa caagtcgtct tcgcctccct tgctgtcctt 1500 
gccctgctgt gtgtcctgct gctgcctgag agccgaagcc gggggctgcc ccagtcactg 1560 
caggacgccg accgcctgcg ccgctcccca ctcctgcggg gccgcccccg ccaggaccac 1620 
ctgcctctgc tgccgccctc caactcctac tgggccggcc acacccccga gcagcactag 1680 
tcctgcctgg tggccctggg agccaggatg ggaccaaagt caaggcctgg ggcatggctg 1740 
agtaccccag acgtctggtc cagggcagac acattcctct cagaagcccg tgtctcagtg 1800 
caggtggagc cgtggggaca gcgtgaaggt gtctccagcc aggccccagg cactgggagg 1860 
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ccctgggtct ccccccagcc acacccagta ggtgtggagg ataaaggctt ctgtggaaaa 1920 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agtatttcat tacctctttc 1980 
tccgcacctg gcctgcaggc ggccgcaggt aagccagccc aggcctcgcc ctccagctca 2040 
aggcgggaag gtgccctaga gtagcctgca tccagggaca ggccccagcc gg 2092 

<210> 20 
<211> 2371 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ttctgggatg gctatggaga cccaaatgtc tcagaatgta tgtcccagaa acctgtggct 60 
gcttcaacca ttgacagttt tgctgctgct ggcttctgca gacagtcaag ctgcagctcc 120 
cccaaaggct gtgctgaaac ttgagccccc gtggatcaac gtgctccagg aggactctgt 180 
gactctgaca tgccaggggg ctcgcagccc tgagagcgac tccattcagt ggttccacaa 240 
tgggaatctc attcccaccc acacgcagcc cagctacagg ttcaaggcca acaacaatga 300 
cagcggggag tacacgtgcc agactggcca gaccagcctc agcgaccctg tgcatctgac 360 
tgtgctttcc gaatggctgg tgctccagac ccctcacctg gagttccagg agggagaaac 420 
catcatgctg aggtgccaca gctggaagga caagcctctg gtcaaggtca cattcttcca 480 
gaatggaaaa tcccagaaat tctcccgttt ggatcccacc ttctccatcc cacaagcaaa 540 
ccacagtcac agtggtgatt accactgcac aggaaacata ggctacacgc tgttctcatc 600 
caagcctgtg accatcactg tccaagtgcc cagcatgggc agctcttcac caatggggat 660 
cattgtggct gtggtcattg cgactgctgt agcagccatt gttgctgctg tagtggcctt 720 
gatctactgc aggaaaaagc ggatttcagc caattccact gatcctgtga aggctgccca 780 
atttgagcca cctggacgtc aaatgattgc catcagaaag agacaacttg aagaaaccaa 840 
caatgactat gaaacagctg acggcggcta catgactctg aaccccaggg cacctactga 90 0 
cgatgataaa aacatctacc tgactcttcc tcccaacgac catgtcaaca gtaataacta 960 
aagagtaacg ttatgccatg tggtcatact ctcagcttgc tagtggatga caaaaagagg 1020 
ggaattgtta aaggaaaatt taaatggaga ctggaaaaat cctgagcaaa caaaaccacc 1080 
tggcccttag aaatagcttt aactttgctt aaactacaaa cacaagcaaa acttcacggg 1140 
gtcatactac atacaagcat aagcaaaact taacttggat catttctggt aaatgcttat 1200 
gttagaaata agacaacccc agccaatcac aagcagccta ctaacatata attaggtgac 1260 
tagggacttt ctaagaagat acctaccccc aaaaaacaat tatgtaattg aaaaccaacc 1320 
gattgccttt attttgcttc cacattttcc caataaatac ttgcctgtga cattttgcca 1380 
ctggaacact aaacttcatg aattgcgcct cagatttttg ctttaacatc tttttttttt 144 0 
tttgacagag tctcaatctg ttacccaggc tggagtgcag tggtgctatc ttggctcact 1500 
gcaaacccgc ctcccaggtt taagcgattc tcatgcctca gcctcccagt agctgggatt 1560 
agaggcatgt gcatcatacc cagctaattt ttgtattttt tattttttat ttttagtaga 1620 
gacagggttt cgcaatgttg gccaggcgat ctcgaacttc tggcctctag cgatctgccg 1680 
cctcggcctc ccaaagtgct gggatgacca gcatcagccc caatgtccag cctctttaac 1740 
atcttctttc ctatgccctc tctgtggatc cctactgctg gtttctgcct tctccatgct 1800 
gagaacaaaa tcacctattc actgcttatg cagtcggaag ctccagaaga acaaagagcc 1860 
caattaccag aaccacatta agtctccatt gttttgcctt gggatttgag aagagaatta 192 0 
gagaggtgag gatctggtat ttcctggact aaattcccct tggaagacga agggatgctg 1980 
cagttccaaa agagaaggac tcttccagag tcatctacct gagtcccaaa gctccctgtc 2040 
ctgaaagcac agacaatatg gtcccaaatg actgactgca ccttctgtgc ctcagccgtt 2100 
cttgacatca agaatcttct gttccacatc cacacagcca atacaattag tcaaaccact 2160 
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gttattaaca gatgtagcaa catgagaaac gcttatgtta caggttacat gagagcaatc 2220 
atgtaagtct atatgacttc agaaatgtta aaatagacta acctctaaca acaaattaaa 2280 
agtgattgtt tcaaggtgat gcaattattg atgacctatt ctatttgtct ataatgatca 2340 
tatattacct ttgtaataaa acattataat c 2371 



<210> 21 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 21 

cttggtcttc ctgctcctga c 21 



<210> 22 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 22 

agggcagaga ggaacagca 19 



<210> 23 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 23 

ccagcgagga gcagcaggga tg 22 



<210> 24 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: Synthetic 



<400> 24 



gcctgtttgg gagattagat ttt 



23 



<210> 25 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 25 

gcccaaacag aacagactaa aaa 23 

<210> 26 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 27 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 27 

tggtggcgtt cctcctgtc 19 

<210> 28 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<400> 26 



aggttattag gttattatct ctctctcctg atttttcc 



38 



<220> 
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<223> Description of Artificial Sequence: Synthetic 
<400> 28 

cagagccctt cgtactggaa cac 

<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 29 

tcgtacaggt cctgggtgct ccaca 

<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 30 

cttggcagct cacatggaac 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 31 

ctggggtgtc tctgtcactc tc 

<210> 32 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: Synthetic 



<400> 32 

ccatgaagtc ccaccccttt tctctg 



26 



<210> 33 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 33 

tgcagcagaa aggggagag 19 

<210> 34 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 35 

cgtgggcact cacctcggca ct 22 



<210> 36 
<211> 23 
<212> DNA 

<213> Artificial Sequence 



<400> 34 



tccccattgc cctcaagt 



18 



<220> 



21 
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<223> Description of Artificial Sequence: Synthetic 
<400> 36 

caggctcatt ttattgtggt cat 

<210> 37 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 37 

cccacactga tttaggcaca tag 

<210> 38 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 38 

tttgaaggag ggcaggaaaa actatgtaag 

<210> 39 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 39 

agggagagga gctatggacg t 

<210> 40 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 
<400> 40 

ttttgaggca agactccatc tc 22 



<210> 41 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 41 

ctgccaaggg agagagtgag gtaggc 26 



<210> 42 

<211> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 42 

tggggacaat atggacctca 20 



<210> 43 

<211> 22 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial 
<400> 43 

ggcgagtgtc tatgatgaac ct 



Sequence : Synthetic 



22 



<210> 44 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: Synthetic 
<400> 44 

caggatctgt gaggatttca tttggataca t 

<210> 45 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 45 

ctccgtggct cgtgctt 

<210> 46 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 46 

cgctttcttt ttgccctctt gt 



<210> 47 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 47 

ccgaccttga gattattcct gt 

<210> 48 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 



24 
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<223> Description of Artificial Sequence: Synthetic 
<400> 48 

gcaccactta aaccaaatcc a 

<210> 49 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 49 

tgctgccaac accacttctc catct 

<210> 50 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 50 

tgctgccaca aaccgaga 

<210> 51 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 51 

ttgggagggt tggttggtt 

<210> 52 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 



<400> 52 

ttttgagggc actagggaac gatctgt 



27 



<210> 53 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 53 

cctgatacct ttaaccaatg ctct 24 



<210> 54 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 55 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 55 

cctgtccttc tcctttggct tatgctatcc 3 0 

<210> 56 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<400> 54 



ttgggtagta tcaaatgggt aagg 



24 



<220> 



26 
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<223> Description of Artificial Sequence: Synthetic 



<400> 56 

ctcggatatg attaaagagt ttcg 



24 



<210> 57 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 57 

tccactgtgc tgtttgttgt t 21 

<210> 58 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 59 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 59 

tggctaaaat aggtcttgta ggga 24 

<210> 60 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



<400> 58 

attggcgtgc tctttgtaac tctgaga 



27 



<220> 
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<223> Description of Artificial Sequence: Synthetic 



<400> 60 



caaggagggg gcatttgta 



19 



<210> 61 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 61 

tcctttcctt ggcaatctcc tctcctg 27 

<210> 62 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 63 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 63 

attccagcct gagtcacaca ga 22 

<210> 64 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<400> 62 



cctctgaaga aacgatcaca aca 



23 



<220> 



28 
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<223> Description of Artificial Sequence: Synthetic 



<400> 64 



accaaggaga aacaaaacca agcagca 



27 



<210> 65 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 65 

tgaggagaaa gaagggaatc ac 22 

<210> 66 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<210> 67 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 67 

agcaatgaag aatgaacttg gagtaaagag tea 33 

<210> 68 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<400> 66 



tcctaaggta gcactatttg gagac 



25 



<220> 



29 



WO 02/18576 

<223> Description of Artificial Sequence: Synthetic 
<400> 68 

atgggcaggt ctttctttcc 

<210> 69 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 69 

aggcagttct gttaccccac ta 

<210> 70 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 70 

tgtgctaagg acaggattgg ttgggta 

<210> 71 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 71 

actgcccacc acgctttata 

<210> 72 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 



WO 02/18576 

<223> Description of Artificial Sequence: Synthetic 



PCT/US01/26684 



<400> 72 

tgagggtggg gagaggttac 20 

<210> 73 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 73 

agtcacatta ttagaggttc gcatctcagg 30 

<2lo> 74 
<211> 2722 
<212> DNA 

<213> Homo sapiens 
<400> 74 

gtttggatct tggttcattc tcaagcctca gacagtggtt caaagttttt ttcttccatt 60 
tcaggtgtcg tgaaaagctt gaattcggcg cgccagatat cacacgtgcc aaggggctgg 120 
ctcagcgaca gcggcgactg cggcggccgc gggagggcat cccgttgggg atccttccgc 180 
acactgaaga gtacgtcttc gggtctaccc ctaatcacat aatggctgtg tttaatcaga 240 
agtctgtctc ggatatgatt aaagagtttc gaaaaaattg gcgtgctctt tgtaactctg 300 
agagaactac tctatgtggt gcagactcca tgctcttggc attgcagctt tctatggcgg 360 
agaacaacaa acagcacagt ggagaattta cagtctctct cagtgatgtt ttattgacat 420 
ggaaatactt gctccatgag aaattgaact taccagttga aaacatggac gtgactgacc 480 
attatgagga cgttaggaag atttatgatg atttcttgaa gaacagtaat atgttagatc 540 
tgattgatgt ttatcaaaaa tgtagggctt tgacttctaa ttgtgaaaat tataacacag 600 
tatctcctag tcaactactg gattttctgt ctggcaaaca gtatgcagta ggtgatgaaa 660 
ctgatctttc tataccaaca tcaccaacaa gtaaatacaa ccgtgataat gaaaaggtgc 720 
agctgctagc aaggaaaatt atcttttcat atttaaatct gctagtgaat tcaaagaatg 780 
acctggctgt ggcttatatt ctcaatattc ctgatagagg actaggaaga gaagccttca 840 
ctgatttgaa acatgctgct cgagagaaac aaatgtctat ctttttggtg gccacgtctt 900 
ttattagaac aatagagctt ggagggaaag gatatgcacc accaccatca gatcctttaa 960 
ggacacatgt aaagggattg tctaatttta ttaatttcat tgacaaatta gatgagattc 1020 
ttggagaaat accaaaccca agagggtgta aatccatctg ttggaagatc aacaattgga 1080 
acgagttttg gaaatgttca tctggacaga agtaaaaatg aaaaagtatc aagaaaatca 1140 
accagtcaga caggaaataa aagctcaaaa aggaaacagg tggatttgga tggtgaaaat 1200 
attctctgtg ataatagaaa tgaaccacct caacataaaa atgctaaaat acctaagaaa 1260 
tcaaatgatt cacagaatag attgtacggc aaactagcta aagtagcaaa aagtaataaa 1320 
tgtactgcca aggacaagtt gatttctggc caggcaaagt taactcagtt ttttagacta 1380 
taaatttgtg tcttatatgc tttaggttta tgtatctata aaccattcac caaagacatg 1440 
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cttaattttt aagagatcaa ggtgtaaatt 
atgtaaggtt agtatgttaa gcattgttta 
ttttcacaaa gtttaatgca cagagaaagc 
actactttct tttaaaacag acatttaaca 
tcctcccatt ggcaattaaa tgcttttatt 
gtatcagact tgccaacaag gtcggtagac 
aagaagaaag tttaaattgt ttaaaggact 
aaagaatgga tctagtataa ctaattctga 
ctatttaatc ccacattttt ggcaggtgta 
tgattgcatc caaattcact ttaactcaga 
ttgaatttga aaagactact cactgtcaaa 
gttttcttca tccccaattt ctctcttttc 
ctcagctggg aaagctacag atccttttag 
tggcagacca tgatttaaga aattatgttt 
tttggttttt agctatcgta ttcggagtgg 
tgatcaattc aaagttactc tgcactgttt 
tttataggag aaaaaacact ttcagataag 
gcatgtaaga aatatcgtca gtcgtcctaa 
gtttataaaa gtatcagttt tacttttcag 
tgaaataaac gacaagtcac attgccaaaa 
ctctttctcc gcacctggcc tgcaggcggc 
cagctcaggc gggacaggag eg 



atgatgattt attattttgg tctacagtgt 1500 
aaaatactag taagtcataa ttatgcagaa 1560 
atatcatttc agttactgat acatcttaac 1620 
tacacaagtt atagtagcag tatgggcttc 1680 
ttcttctgaa aagatgatgt ggaccaacag 1740 
tcttcccagc atacatctga gcactgaagg 1800 
ataattatca cacaaaattt attaagaaaa 1860 
gtaaaccaaa atgataataa ttaattgttg 1920 
attgagecat ggtcttattt gattttgtta 1980 
gttctgttta atggtggtag gatgtaagaa 2040 
atctctcctt cctataggaa atttagctga 2100 
ttgtgttgat tcagtattct gaactccatt 2160 
tgcaagataa ggttttatag ccagattcag 2220 
ggagcctgtg ttctgtaaag agaaggttga 2280 
aactataata caattgtata atattcttgt 2340 
ttgacttttt aaaaatacct tagatgeaaa 2400 
aggtgtttgc tgggatggaa gaactacctg 2460 
tgcatattgt gactgtttgc atatacttct 2520 
aggatttgta agaatcattt aaattttcat 2580 
aaaaaaaaaa aaaaaaaagt atttcattac 264 0 
cgcaggtaag ccagcccagg cctcgccctc 2700 

2722 



<210> 75 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<400> 75 

Val Leu Asn Ala Phe Leu Gin Pro Pro Gly Arg Gin Met lie Ala lie 
15 io 15 

Arg Lys Arg Gin Pro Glu Glu Thr Asn Asn Asp Tyr Glu Thr Ala Asp 
20 25 30 

Gly Gly Tyr Met Thr Leu Asn Pro Arg Ala Pro Thr Asp Asp Asp Lys 
35 40 45 

Asn lie Tyr Leu Thr Leu Pro Pro Asn Asp His Val Asn Ser Asn Asn 
50 55 60 



<210> 76 
<211> 261 
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<212> PRT 

<213> Homo sapiens 

<400> 76 

Met Ser Thr Thr Thr Cys Gin Val Val Ala Phe Leu Leu Ser He Leu 
1 5 io 15 

Gly Leu Ala Gly Cys He Ala Ala Thr Gly Met Asp Met Trp Ser Thr 
20 25 30 

Gin Asp Leu Tyr Asp Asn Pro Val Thr Ser Val Phe Gin Tyr Glu Gly 
35 40 45 

Leu Trp Arg Ser Cys Val Arg Gin Ser Ser Gly Phe Thr Glu Cys Arg 
50 55 60 

Pro Tyr Phe Thr He Leu Gly Leu Pro Ala Met Leu Gin Ala Val Arg 
65 | 70 75 80 

Ala Leu Met He Val Gly He Val Leu Gly Ala He Gly Leu Leu Val 
85 90 95 

Ser He Phe Ala Leu Lys Cys He Arg He Gly Ser Met Glu Asp Ser 
100 105 no 

Ala Lys Ala Asn Met Thr Leu Thr Ser Gly He Met Phe He Val Ser 
H5 120 125 

Gly Leu Cys Ala lie Ala Gly Val Ser Val Phe Ala Asn Met Leu Val 
130 135 140 

Thr Asn Phe Trp Met Ser Thr Ala Asn Met Tyr Thr Gly Met Gly Gly 
2 45 150 155 160 

Met Val Gin Thr Val Gin Thr Arg Tyr Thr Phe Gly Ala Ala Leu Phe 
165 170 175 

Val Gly Trp Val Ala Gly Gly Leu Thr Leu He Gly Gly Val Met Met 
180 185 190 

Cys He Ala Cys Arg Gly Leu Ala Pro Glu Glu Thr Asn Tyr Lys Ala 
195 200 * 205 

Val Ser Tyr His Ala Ser Gly His Ser Val Ala Tyr Lys Pro Gly Gly 
210 215 220 

Phe Lys Ala Ser Thr Gly Phe Gly Ser Asn Thr Lys Asn Lys Lys He 
225 230 235 240 
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Tyr Asp Gly Gly Ala Arg Thr Glu Asp Glu Val Gin Ser Tyr Pro Ser 
245 250 255 

Lys His Asp Tyr Val 
260 



<210> 77 

<211> 1461 

<212> PRT 

<213> Homo sapiens 

<400> 77 

Met Glu Ala Arg Ser Arg Ser Ala Glu Glu Leu Arg Arg Ala Glu Leu 
15 10 15 

Val Glu He He Val Glu Thr Glu Ala Gin Thr Gly Val Ser Gly He 
20 25 30 

Asn Val Ala Gly Gly Gly Lys Glu Gly He Phe Val Arg Glu Leu Arg 
35 40 45 

Glu Asp Ser Pro Ala Ala Arg Ser Leu Ser Leu Gin Glu Gly Asp Gin 
50 55 60 

Leu Leu Ser Ala Arg Val Phe Phe Glu Asn Phe Lys Tyr Glu Asp Ala 
65 70 75 80 

Leu Arg Leu Leu Gin Cys Ala Glu Pro Tyr Lys Val Ser Phe Cys Leu 
85 90 95 

Lys Arg Thr Val Pro Thr Gly Asp Leu Ala Leu Arg Pro Gly Thr Val 
100 105 110 

Ser Gly Tyr Glu He Lys Gly Pro Arg Ala Lys Val Ala Lys Leu Asn 
115 120 125 

He Gin Ser Leu Ser Pro Val Lys Lys Lys Lys Met Val Pro Gly Ala 
130 135 140 

Leu Gly Val Pro Ala Asp Leu Ala Pro Val Asp Val Glu Phe Ser Phe 
145 150 155 160 

Pro Lys Phe Ser Arg Leu Arg Arg Gly Leu Lys Ala Glu Ala Val Lys 
165 170 175 

Gly Pro Val Pro Ala Ala Pro Ala Arg Arg Arg Leu Gin Leu Pro Arg 
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180 



185 



190 



Leu Arg Val Arg Glu Val Ala Glu Glu Ala Gin Ala Ala Arg Leu Ala 
195 200 205 

Ala Ala Ala Pro Pro Pro Arg Lys Ala Lys Val Glu Ala Glu Val Ala 
210 215 220 

Ala Gly Ala Arg Phe Thr Ala Pro Gin Val Glu Leu Val Gly Pro Arg 
225 230 235 240 

Leu Pro Gly Ala Glu Val Gly Val Pro Gin Val Ser Ala Pro Lys Ala 
245 250 255 

Ala Pro Ser Ala Glu Ala Ala Gly Gly Phe Ala Leu His Leu Pro Thr 
260 265 270 

Leu Gly Leu Gly Ala Pro Ala Pro Pro Ala Val Glu Ala Pro Ala Val 
275 280 285 

Gly He Gin Val Pro Gin Val Glu Leu Pro Ala Leu Pro Ser Leu Pro 
290 295 300 



Thr Leu Pro Thr Leu Pro Cys Leu Glu Thr Arg Glu Gly Ala Val Ser 
305 310 315 



320 



Val Val Val Pro Thr Leu Asp Val Ala Ala Pro Thr Val Gly Val Asp 
325 330 335 

Leu Ala Leu Pro Gly Ala Glu Val Glu Ala Arg Gly Glu Ala Pro Glu 
340 345 350 

Val Ala Leu Lys Met Pro Arg Leu Ser Phe Pro Arg Phe Gly Ala Arg 
3 55 360 365 

Ala Lys Glu Val Ala Glu Ala Lys Val Ala Lys Val Ser Pro Glu Ala 
370 375 380 - 

Arg Val Lys Gly Pro Arg Leu Arg Met Pro Thr Phe Gly Leu Ser Leu 
385 390 395 400 

Leu Glu Pro Arg Pro Ala Ala Pro Glu Val Val Glu Ser Lys Leu Lys 
405 410 415 

Leu Pro Thr He Lys Met Pro Ser Leu Gly He Gly Val Ser Gly Pro 
420 425 430 



Glu val Lys Val Pro Lys Gly Pro Glu Val Lys Leu Pro Lys Ala Pro 
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435 440 



445 



Glu val Lys Leu Pro Lys Val Pro Glu Ala Ala Leu Pro Glu Val Arg 
450 455 460 

Leu Pro Glu Val Glu Leu Pro Lys Val Ser Glu Met Lys Leu Pro Lys 
465 470 475 480 

Val Pro Glu Met Ala Val Pro Glu Val Arg Leu Pro Glu Val Glu Leu 
485 490 495 

Pro Lys Val Ser Glu Met Lys Leu Pro Lys Val Pro Glu Met Ala Val 
500 505 sio 

Pro Glu Val Arg Leu Pro Glu Val Gin Leu Leu Lys Val Ser Glu Met 
515 520 525 

Lys Leu Pro Lys Val Pro Glu Met Ala Val Pro Glu Val Arg Leu Pro 
530 535 540 

Glu Val Gin Leu Pro Lys Val Ser Glu Met Lys Leu Pro Glu Val Ser 
545 550 555 560 

Glu Val Ala Val Pro Glu Val Arg Leu Pro Glu Val Gin Leu Pro Lys 
565 570 575 

Val Pro Glu Met Lys Val Pro Glu Met Lys Leu Pro Lys Val Pro Glu 
58 ° 585 590 

Met Lys Leu Pro Glu Met Lys Leu Pro Glu Val Gin Leu Pro Lys Val 
595 600 605 

Pro Glu Met Ala Val Pro Asp Val His Leu Pro Glu Val Gin Leu Pro 
610 615 620 

Lys Val Pro Glu Met Lys Leu Pro Glu Met Lys Leu Pro Glu Val Lys 
625 630 635 640 

Leu Pro Lys Val Pro Glu Met Ala Val Pro Asp Val His Leu Pro Glu 
645 650 655 

Val Gin Leu Pro Lys Val Pro Glu Met Lys Leu Pro Lys Met Pro Glu 
660 665 670 

Met Ala Val Pro Glu Val Arg Leu Pro Glu Val Gin Leu Pro Lys Val 
675 680 685 

Ser Glu Met Lys Leu Pro Lys Val Pro Glu Met Ala Val Pro Asp Val 
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690 



695 



700 



His Leu Pro Glu Val Gin Leu Pro Lys Val Cys Glu Met Lys Val Pro 
70S 710 715 720 

Asp Met Lys Leu Pro Glu He Lys Leu Pro Lys Val Pro Glu Met Ala 
725 730 735 

Val Pro Asp Val His Leu Pro Glu Val Gin Leu Pro Lys Val Ser Glu 
740 745 750 

He Arg Leu Pro Glu Met Gin Val Pro Lys Val Pro Asp Val His Leu 
755 760 765 

Pro Lys Ala Pro Glu Val Lys Leu Pro Arg Ala Pro Glu Val Gin Leu 
770 775 780 

Lys Ala Thr Lys Ala Glu Gin Ala Glu Gly Met Glu Phe Gly Phe Lys 
785 790 795 800 

Met Pro Lys Met Thr Met Pro Lys Leu Gly Arg Ala Glu Ser Pro Ser 
805 810 815 

Arg Gly Lys Pro Gly Glu Ala Gly Ala Glu Val Ser Gly Lys Leu Val 
820 825 830 

Thr Leu Pro Cys Leu Gin Pro Glu Val Asp Gly Glu Ala His Val Gly 
835 840 845 

Val Pro Ser Leu Thr Leu Pro Ser Val Glu Leu Asp Leu Pro Gly Ala 
850 855 860 

Leu Gly Leu Gin Gly Gin Val Pro Ala Ala Lys Met Gly Lys Gly Glu 
865 870 875 880 

Arg Ala Glu Gly Pro Glu Val Ala Ala Gly Val Arg Glu Val Gly Phe 
885 890 895 

Arg Val Pro Ser Val Glu He Val Thr Pro Gin Leu Pro Ala Val Glu 
900 905 910 

He Glu Glu Gly Arg Leu Glu Met He Glu Thr Lys Val Lys Pro Ser 
915 920 925 

Ser Lys Phe Ser Leu Pro Lys Phe Gly Leu Ser Gly Pro Lys Val Ala 
930 935 940 



Lys Ala Glu Ala Glu Gly Ala Gly Arg Ala Thr Lys Leu Lys Val Ser 
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9 45 950 955 



960 



Lys Phe Ala lie Ser Leu Pro Lys Ala Arg Val Gly Ala Glu Ala Glu 
965 970 975 

Ala Lys Gly Ala Gly Glu Ala Gly Leu Leu Pro Ala Leu Asp Leu Ser 
980 985 990 

He Pro Gin Leu Ser Leu Asp Ala His Leu Pro Ser Gly Lys Val Glu 
995 1000 1005 

Val Ala Gly Ala Asp Leu Lys Phe Lys Gly Pro Arg Phe Ala Leu Pro 
1010 1015 1020 

Lys Phe Gly Val Arg Gly Arg Asp Thr Glu Ala Ala Glu Leu Val Pro 
1025 1030 1035 1040 

Gly Val Ala Glu Leu Glu Gly Lys Gly Trp Gly Trp Asp Gly Arg Val 
1045 1050 1055 

Lys Met Pro Lys Leu Lys Met Pro Ser Phe Gly Leu Ala Arg Gly Lys 
1060 1065 1070 

Glu Ala Glu Val Gin Gly Asp Arg Ala Ser Pro Gly Glu Lys Ala Glu 
1075 1080 1085 

Ser Thr Ala Val Gin Leu Lys He Pro Glu Val Glu Leu Val Thr Leu 
1Q 90 1095 1100 

Gly Ala Gin Glu Glu Gly Arg Ala Glu Gly Ala Val Ala Val Ser Gly 
1105 IHO 1115 1120 

Met Gin Leu Ser Gly Leu Lys Val Ser Thr Ala Arg Gin Val Val Thr 
1125 1130 1135 

Glu Gly His Asp Ala Gly Leu Arg Met Pro Pro Leu Gly He Ser Leu 
1140 1145 H50 

Pro Gin Val Glu Leu Thr Gly Phe Gly Glu Ala Gly Thr Pro Gly Gin 
H55 H60 H65 

Gin Ala Gin Ser Thr Val Pro Ser Ala Glu Gly Thr Ala Gly Tyr Arg 
ll 70 1175 H80 

Val Gin Val Pro Gin Val Thr Leu Ser Leu Pro Gly Ala Gin Val Ala 
1185 H90 H95 1200 

Gly Gly Glu Leu Leu Val Gly Glu Gly Val Phe Lys Met Pro Thr Val 
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Thr Val Pro Gin Leu Glu Leu Asp Val Gly Leu Ser Arg Glu Ala Gin 
1220 1225 1230 

Ala Gly Glu Ala Ala Thr Gly Glu Gly Gly Leu Arg Leu Lys Leu Pro 
1235 1240 1245 

Thr Leu Gly Ala Arg Ala Arg Val Gly Gly Glu Gly Ala Glu Glu Gin 
1250 1255 1260 

Pro Pro Gly Ala Glu Arg Thr Phe Cys Leu Ser Leu Pro Asp Val Glu 
1265 12 70 1275 1280 

Leu Ser Pro Ser Gly Gly Asn His Ala Glu Tyr Gin Val Ala Glu Gly 
12 85 1290 1295 

Glu Gly Glu Ala Gly His Lys Leu Lys Val Arg Leu Pro Arg Phe Gly 
1300 1305 1310 

Leu Val Arg Ala Lys Glu Gly Ala Glu Glu Gly Glu Lys Ala Lys Ser 
1315 1320 1325 

Pro Lys Leu Arg Leu Pro Arg Val Gly Phe Ser Gin Ser Glu Met Val 
1330 1335 1340 

Thr Gly Glu Gly Ser Pro Ser Pro Glu Glu Glu Glu Glu Glu Glu Glu 
1345 135 ° 1355 1360 

Glu Gly Ser Gly Glu Gly Ala Ser Gly Arg Arg Gly Arg Val Arg Val 
1365 1370 i3 75 

Arg Leu Pro Arg Val Gly Leu Ala Ala Pro Ser Lys Ala Ser Arg Gly 
1380 1385 1390 

Gin Glu Gly Asp Ala Ala Pro Lys Ser Pro Val Arg Glu Lys Ser Pro 
13 95 1400 1405 

Lys Phe Arg Phe Pro Arg Val Ser Leu Ser Pro Lys Ala Arg Ser Gly 
1410 1415 1420 

Ser Gly Asp Gin Glu ciu Gly Gly Leu Arg Val Arg Leu Pro Ser Val 
1425 1430 1435 1440 

Gly Phe Ser Glu Thr Gly Ala Pro Gly Pro Ala Arg Met Glu Gly Ala 
1445 i45 0 1455 

Gin Ala Ala Ala Val 
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1460 



<210> 78 

<211> 879 

<212> PRT 

<213> Homo sapiens 

<400> 78 

Arg Glu Leu Trp Thr Phe Ala Gly Ser Arg Asp Pro Ser Ala Pro Arg 
1 5 10 is 

Leu Ala Tyr Gly Tyr Gly Pro Gly Ser Leu Arg Glu Leu Arg Ala Arg 
20 25 30 

Glu Phe Ser Arg Leu Ala Gly Thr Val Tyr Leu Asp His Ala Gly Ala 
35 40 45 

Thr Leu Phe Ser Gin Ser Gin Leu Glu Ser Phe Thr Ser Asp Leu Met 
50 55 60 

Glu Asn Thr Tyr Gly Asn Pro His Ser Gin Asn lie Ser Ser Lys Leu 
65 70 75 80 

Thr His Asp Thr Val Glu Gin Val Arg Tyr Arg lie Leu Ala His Phe 
85 90 95 

His Thr Thr Ala Glu Asp Tyr Thr Val lie Phe Thr Ala Gly Ser Thr 
100 105 no 

Ala Ala Leu Lys Leu Val Ala Glu Ala Phe Pro Trp Val Ser Gin Gly 
115 120 125 

Pro Glu Ser Ser Gly Ser Arg Phe Cys Tyr Leu Thr Asp Ser His Thr 
130 135 140 

Ser Val Val Gly Met Arg Asn Val Thr Met Ala He Asn Val lie Ser 
145 150 155 160 

He Pro Val Arg Pro Glu Asp Leu Trp Ser Ala Glu Glu Arg Gly Ala 
165 170 175 

Ser Ala Ser Asn Pro Asp Cys Gin Leu Pro His Leu Phe Cys Tyr Pro 
180 185 190 

Ala Gin Ser Asn Phe Ser Gly Val Arg Tyr Pro Leu Ser Trp He Glu 
195 200 205 
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Glu Val Lys Ser Gly Arg Leu Arg Pro Val Ser Thr Pro Gly Lys Trp 
210 215 220 

Phe Val Leu Leu Asp Ala Ala Ser Tyr Val Ser Thr Ser Pro Leu Asp 
225 230 235 240 

Leu Ser Ala His Gin Ala Asp Phe Val Pro lie Ser Phe Tyr Lys lie 
245 250 255 

Phe Gly Phe Arg Thr Gly Leu Gly Ala Leu Trp Val His Asn Arg Ala 
260 265 270 

Ala Pro Leu Leu Arg Lys Thr Tyr Phe Gly Gly Gly Thr Ala Ser Ala 
275 280 285 

Tyr Leu Ala Gly Glu Asp Phe Tyr He Pro Arg Gin Ser Val Ala Gin 
290 295 300 

Arg Phe Glu Asp Gly Thr He Ser Phe Leu Asp Val He Ala Leu Lys 
305 310 315 320 

His Gly Phe Asp Thr Leu Glu Arg Leu Thr Gly Gly Met Glu Asn He 
325 330 335 

Lys Gin His Thr Phe Thr Leu Ala Gin Tyr Thr Tyr Met Ala Leu Ser 
340 345 350 

Ser Leu Gin Tyr Pro Asn Gly Ala Pro Val Val Arg He Tyr Ser Asp 
355 360 365 

Ser Glu Phe Ser Ser Pro Glu Val Gin Gly Pro He He Asn Phe Asn 
370 375 380 

Val Leu Asp Asp Lys Gly Asn He lie Gly Tyr Ser Gin Val Asp Lys 
385 390 395 400 

Met Ala Ser Leu Tyr Asn He His Leu Arg Thr Gly Cys Phe Cys Asn 
405 410 415 

Thr Gly Ala Cys Gin Arg His Leu Gly lie Ser Asn Glu Met Val Arg 
420 425 430 

Lys His Phe Gin Ala Gly His Val Cys Gly Asp Asn Met Asp Leu He 
435 440 445 



Asp Gly Gin Pro Thr Gly Ser Val Arg lie Ser Phe Gly Tyr Met Ser 
450 455 460 
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Thr Leu Asp Asp Val Gin Ala Phe Leu Arg Phe He lie Asp Thr Arg 
465 470 475 480 

Leu His Ser Ser Gly Asp Trp Pro Val Pro Gin Ala His Ala Asp Thr 
485 490 495 

Gly Glu Thr Gly Ala Pro Ser Ala Asp Ser Gin Ala Asp Val He Pro 
5 °0 505 510 

Ala Val Met Gly Arg Arg Ser Leu Ser Pro Gin Glu Asp Ala Leu Thr 
515 520 525 

Gly Ser Arg Val Trp Asn Asn Ser Ser Thr Val Asn Ala Val Pro Val 
530 535 540 

Ala Pro Pro Val Cys Asp Val Ala Arg Thr Gin Pro Thr Pro Ser Glu 
545 550 555 560 

Lys Ala Ala Gly Val Leu Glu Gly Ala Leu Gly Pro His Val Val Thr 
565 570 575 

Asn Leu Tyr Leu Tyr Pro He Lys Ser Cys Ala Ala Phe Glu Val Thr 
580 585 590 

Arg Trp Pro Val Gly Asn Gin Gly Leu Leu Tyr Asp Arg Ser Trp Met 
595 600 605 

Val Val Asn His Asn Gly Val Cys Leu Ser Gin Lys Gin Glu Pro Arg 
610 615 620 

Leu Cys Leu lie Gin Pro Phe He Asp Leu Arg Gin Arg He Met Val 
625 630 635 640 

He Lys Ala Lys Gly Met Glu Pro lie Glu Val Pro Leu Glu Glu Asn 
645 650 655 

Ser Glu Arg Thr Gin He Arg Gin Ser Arg Val Cys Ala Asp Arg Val 
660 665 670 

Ser Thr Tyr Asp Cys Gly Glu Lys He Ser Ser Trp Leu Ser Thr Phe 
675 680 685 

Phe Gly Arg Pro Cys His Leu He Lys Gin Ser Ser Asn Ser Gin Arg 
690 695 700 

Asn Ala Lys Lys Lys His Gly Lys Asp Gin Leu Pro Gly Thr Met Ala 
705 710 715 72Q 
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Thr Leu Ser Leu Val Asn Glu Ala Gin Tyr Leu Leu lie Asn Thr Ser 
725 730 735 

Ser He Leu Glu Leu His Arg Gin Leu Asn Thr Ser Asp Glu Asn Gly 
740 745 750 

Lys Glu Glu Leu Phe Ser Leu Lys Asp Leu Ser Leu Arg Phe Arg Ala 
755 760 765 

Asn He He He Asn Gly Lys Arg Ala Phe Glu Glu Glu Lys Trp Asp 
770 775 780 

Glu He Ser He Gly Ser Leu Arg Phe Gin Val Leu Gly Pro Cys His 
785 790 795 800 

Arg Cys Gin Met He Cys He Asp Gin Gin Thr Gly Gin Arg Asn Gin 
805 810 815 

His Val Phe Gin Lys Leu Ser Glu Ser Arg Glu Thr Lys Val Asn Phe 
820 825 830 

Gly Met Tyr Leu Met His Ala Ser Leu Asp Leu Ser Ser Pro Cys Phe 
835 840 845 

Leu Ser Val Gly Ser Gin Val Leu Pro Val Leu Lys Glu Asn Val Glu 
850 855 860 



Gly His Asp Leu Pro Ala Ser Glu Lys His Gin Asp Val Thr Ser 
865 870 875 



<210> 79 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Ser Phe Phe Phe Phe Leu Arg Ala Ser Leu Thr Leu Ser Pro Arg Leu 
15 10 15 

Glu Cys Ser Gly Thr He Ala Ala His Cys Asn Pro His Leu Pro Gly 
20 25 30 

Ser Ser Asn Tyr Ala Ala Ser Ala Ser Gin Glu Ala Gly Thr Ser Gly 
35 40 45 

Met Ser His His Thr Trp He He Phe Cys lie Phe Leu Val Glu Thr 
50 55 60 
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Gly Phe His His Val Gly Gin Ala Gly Leu Glu Leu Leu Ser Ser Ser 
65 70 75 80 

Asp Ser Pro Pro Thr Leu Ala Ser Gin Ser Ala Gly He Thr Gly Met 
85 90 95 

Ser His His Ala Gin Pro Ala Thr Leu Ser Phe 
100 105 



<210> 80 

<211> 93 

<212> PRT 

<213> Homo sapiens 



<400> 80 

Gin Asp Arg He He Asn Leu Val 
1 5 

He Leu Val Thr Leu He Ser Ala 
20 

Lys Pro Leu Asn He Phe Phe Ala 
35 40 

Thr Ala Cys He Leu He Tyr Trp 
50 55 

Lys Phe Arg Lys Leu He Tyr Tyr 
65 70 

Cys He Cys Ala Asn Leu Tyr Phe 
85 



Val Gly Ser Leu Thr Ser Leu Leu 
10 15 

Phe Val Phe Pro Gin Leu Pro Pro 
25 30 

Val Cys He Ser Leu Ser Ser He 
45 

Tyr Arg Gin Gly Asp Leu Glu Pro 
60 

He He Phe Ser He He Met Leu 
75 80 

His Asp Val Gly Arg 
90 



<210> 81 

<211> 498 

<212> PRT 

<213> Homo sapiens 



<400> 81 

Met Asp Val Thr Asp His Tyr Glu Asp Val Arg Lys He Tyr Asp Asp 
1 5 lo is 



Phe Leu Lys Asn Ser Asn Met Leu Asp Leu He Asp Val Tyr Gin Lys 
20 25 30 
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Cys Arg Ala Leu Thr Ser Asn Cys Glu Asn Tyr Asn Thr Val Ser Pro 
35 40 45 

Ser Gin Leu Leu Asp Phe Leu Ser Gly Lys Gin Tyr Ala Val Gly Asp 
50 55 60 

Glu Thr Asp Leu Ser He Pro Thr Ser Pro Thr Ser Lys Tyr Asn Arg 



65 70 75 



80 



Asp Asn Glu Lys Val Gin Leu Leu Ala Arg Lys He He Phe Ser Tyr 
85 90 95 

Leu Asn Leu Leu Val Asn Ser Lys Asn Asp Leu Ala Val Ala Tyr He 
100 105 no 

Leu Asn He Pro Asp Arg Gly Leu Gly Arg Glu Ala Phe Thr Asp Leu 
115 120 i 2 5 

Lys His Ala Ala Arg Glu Lys Gin Met Ser He Phe Leu Val Ala Thr 
130 135 140 

Ser Phe He Arg Thr He Glu Leu Gly Gly Lys Gly Tyr Ala Pro Pro 
145 "0 155 i 6 o 

Pro Ser Asp Pro Leu Arg Thr His Val Lys Gly Leu Ser Asn Phe He 
165 170 175 

Asn Phe He Asp Lys Leu Asp Glu He Leu Gly Glu He Pro Asn Pro 
"0 185 190 

Ser lie Ala Gly Gly Gin He Leu Ser Val He Lys Met Gin Leu He 
195 200 205 

Lys Gly Gin Asn Ser Arg Asp Pro Phe Cys Lys Ala He Glu Glu Val 
210 215 220 

Ala Gin Asp Leu Asp Leu Arg He Lys Asn He He Asn Ser Gin Glu 
225 230 235 240 

Gly Val Val Ala Leu Ser Thr Thr Asp He Ser Pro Ala Arg Pro Lys 
245 250 255 

Ser His Ala He Asn His Gly Thr Ala Tyr Cys Gly Arg Asp Thr Val 
260 265 270 

Lys Ala Leu Leu Val Leu Leu Asp Glu Glu Ala Ala Asn Ala Pro Thr 
275 280 285 
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Lys Asn Lys Ala Glu Leu Leu Tyr Asp Glu Glu Asn Thr lie His His 
290 295 300 

His Gly Thr Ser He Leu Thr Leu Phe Arg Ser Pro Thr Gin Val Asn 
305 310 315 320 

Asn Ser He Lys Pro Leu Arg Glu Arg He Cys Val Ser Met Gin Glu 
325 330 335 

Lys Lys He Lys Met Lys Gin Thr Leu He Arg Ser Gin Phe Ala Cys 
340 345 350 

Thr Tyr Lys Asp Asp Tyr Met He Ser Lys Asp Asn Trp Asn Asn Val 
355 360 365 

Asn Leu Ala Ser Lys Pro Leu Cys Val Leu Tyr Met Glu Asn Asp Leu 
370 375 380 

Ser Glu Gly Val Asn Pro Ser Val Gly Arg Ser Thr He Gly Thr Ser 
385 390 395 400 

Phe Gly Asn Val His Leu Asp Arg Ser Lys Asn Glu Lys Val Ser Arg 
405 410 415 

Lys Ser Thr Ser Gin Thr Gly Asn Lys Ser Ser Lys Arg Lys Gin Val 
420 425 430 

Asp Leu Asp Gly Glu Asn He Leu Cys Asp Asn Arg Asn Glu Pro Pro 
435 440 445 

Gin His Lys Asn Ala Lys He Pro Lys Lys Ser Asn Asp Ser Gin Asn 
450 455 460 

Arg Leu Tyr Gly Lys Leu Ala Lys Val Ala Lys Ser Asn Lys Cys Thr 
465 470 475 480 

Ala Lys Asp Lys Leu He Ser Gly Gin Ala Lys Leu Thr Gin Phe Phe 
485 490 495 

Arg Leu 



<210> 82 
<211> 104 
<212> PRT 

<213> Homo sapiens 
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<400> 82 

Phe Tyx Lys Arg Glu Leu Leu Phe Phe Cys Cys Cys Phe Phe Ala Asp 
1 5 io 15 

Ser Thr lie Ser Ala His Cys Gly Leu His Leu Met Asp Ala Arg Asp 
20 25 30 

Pro Pro Thr Ser Ala Ser Gin Ala Gly Thr Thr Val Val Asn His His 
35 40 45 

Ala Cys Leu Leu Phe Lys Phe Cys Val Glu Met Arg Ser His Cys He 
50 55 60 

Ala Ala Ala Gly Leu Glu Leu Leu Val Ser Ser Asn Pro Pro Ser Ser 
65 70 75 80 

Val Phe Gin Ser Ala Gly He Thr Gly Val Ser His Cys Ala Leu Pro 
85 90 95 

Asn Met Gly Ser Phe Arg His Ala 
100 



<210> 83 
<211> 216 
<212> PRT 

<213> Homo sapiens 
<400> 83 

Ser Glu Glu Thr He Thr Thr Thr He Gin Asp Leu Phe Pro Lys Val 
15 10 is 

Met Lys Lys Met Arg Val Pro He Thr Leu Gly Cys Cys Leu Val Leu 
20 25 30 

Phe Leu Leu Gly Leu Val Cys Val Thr Gin Ala Gly He Tyr Trp Val 
35 40 45 

His Leu He Asp His Phe Cys Ala Gly Trp Gly He Leu He Ala Ala 
50 55 60 

He Leu Glu Leu Val Gly He He Trp He Tyr Gly Gly Asn Arg Phe 
65 70 75 80 

He Glu Asp Thr Glu Met Met He Gly Ala Lys Arg Trp He Phe Trp 
85 90 95 

Leu Trp Trp Arg Ala Cys Trp Phe Val He Thr Pro He Leu Leu He 
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100 105 no 

Ala lie Phe lie Trp Ser Leu Val Gin Phe His Arg Pro Asn Tyr Gly 



115 



120 



125 



Ala lie Pro Tyr Pro Asp Trp Gly Val Ala Leu Gly Trp Cys Met He 
130 135 



140 



Val Phe Cys He He Trp He Pro He Met Ala He He Lys He II 
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150 



155 i 6 o 
Gin Ala Lys Gly Asn He Phe Gin Arg Leu He Ser Cys Cys Arg Pro 



165 



170 



175 



Ala Ser Asn Trp Gly Pro Tyr Leu Glu Gin His Arg Gly Glu Arg Tyr 
180 185 190 

Lys Asp Met Val Val Pro Lys Lys Glu Ala Gly His Glu He Pro Thr 
195 200 



205 



Val Ser Gly Ser Arg Lys Pro Glu 
210 215 



<210> 84 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 84 

Gly Gly Leu Phe Val Ala Gly He Asn Leu Thr Glu Asn Leu Gin Tyr 
15 io 15 

Val Leu Ala His Pro Ser Glu Ser Leu Glu Lys Met Thr Leu Pro Asn 
20 25 30 

Leu Pro Arg Leu Ser Ala Trp Val Arg Glu Gin Cys Pro Gly Pro Gly 
35 40 45 

Ser Arg Cys Thr Asn lie lie Ala Gly Asp Phe lie Gly Ala Asp Gly 
50 55 60 

Phe Val Ser Asp Val He Ala Leu Asn Gin Lys Leu Leu Trp Cys 
65 70 75 
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